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MOTHOnS FOR TREATING MUSCLE DISEASES AND DISORDERS 

Background gf tire invention 
The invention relates to prophylactic or affirmative 
5 treatment of diseases and disorders of the musculature by 
administering polypeptides found in vertebrate species, 
which polypeptides are growth , differentiation and survival 
factors for muscle cells* 

Muscle tissue in adult vertebrates will regenerate 

10 from reserve myoblasts called satellite cells* Satellite 
cells are distributed throughout muscle tissue and are 
mitotically quiescent in the absence of injury or disease* 
Following muscle injury or during recovery from disease, 
satellite cells will reenter the cell cycle, proliferate and 

15 1) enter existing muscle fibers or 2) undergo 

differentiation into multinucleate myotubes which form new 
muscle fiber. The myoblasts ultimately yield replacement 
muscle fibers or fuse into existing muscle fibers, thereby 
increasing fiber girth by the synthesis of contractile 

20 apparatus components. This process is illustrated, for 

example, by the nearly complete regeneration which occurs in 
mammals following induced muscle fiber degeneration; the 
muscle progenitor cells proliferate and fuse together 
regenerating muscle fibers. 

25 Several growth factors which regulate the 

proliferation and differentiation of adult (and embryonic) 
myoblasts in vitro have been identified. Fibroblast growth 
factor (FGF) is mitogenic for muscle cells and is an 
inhibitor of muscle differentiation. Transforming growth 

30 factor fi (TGF/0 has no effect on myoblast proliferation, but 
is an inhibitor of muscle differentiation. Insulin-like 
growth factors (IGFs) have been shown to stimulate both 
myoblast proliferation and differentiation in rodents. 
Platelet derived growth factor (PDGF) is also mitogenic for 

35 myoblasts and is a potent inhibitor of muscle cell 



differentiation see: Florini and Magri, 1989: 256 :C701- 
C711) . 

In vertebrate species both muscle tissue and 
neurons are potential sources of factors which stimulate 
5 myoblast proliferation and differentiation. In diseases 
affecting the neuromuscular system which are neural in 
origin (i.e., neurogenic), the muscle tissue innervated 
by the affected nerve becomes paralyzed and wastes 
progressively. During peripheral nerve regeneration and 

10 recovery from neurologic and myopathic disease, neurons 
may provide a source of growth factors which elicit the 
muscle regeneration described above and provide a 
mechanism for muscle recovery from wasting and atrophy* 

A recently described family of growth factors, the 

15 neuregulins, are synthesized by motor neurons (Marchioni 
et al. Nature 362:313, 1993) and inflammatory cells 
(Tarafchovaky et al., Oncogene 6:2187-2196 (1991)). The 
neuregulins and related pl85* rbB2 binding factors have 
been purified, cloned and expressed (Benveniste et al., 

20 PNAS £2:3930*3934, 1985; Rimura et al., Nature 

141:257-260, 1990; Davis and Stroobant, J. Cell. Biol. 
Hfi:1353-1360 # 1990; Wen et al., Cell ££:559, 1992; 
Yarden and Ullrich, Ann. Rev. Biochem. £2:443, 1988; 
Holmes et al., Science 25£:1205, 1992; Dobashi et al., 

25 Proc. Natl. Acad. Sci. Aft: 8582, 1991; Lupu et al., Proc. 
Natl. Acad. Sci. £2:2287, 1992). Recombinant neuregulins 
have been shown to be mitogenic for peripheral glia 
(Karchionni et al.. Nature 362:313, 1993) and have been 
shown to influence the formation of the neuromuscular 

30 junction (Falls et al., Cell 72:801, 1993). Thus the 

regenerating neuron and the inflammatory cells associated 
with the recovery from neurogenic disease and nerve 
injury provide a source of factors which coordinate the 
remyelination of motor neurons and their ability to form 

35 the appropriate connection with their target. After 
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muscle has been reinnervat d th m t r neuron may provid 
factors to muscle, stimulating muscle growth and 
survival. 

Currently, there is no useful therapy for the 
5 promotion of muscle differentiation and survival . Such a 
therapy would be useful for treatment of a variety of 
neural and muscular diseases and disorders. 

*Vm mir Y nt * h « Invention 
We have discovered that increased mitogenesis 

10 differentiation and survival of muscle cells may be 
achieved using proteins heretofore described as glial 
growth factors, acetylcholine receptor inducing activity 
(ARIA), heregulins, neu differentiation factor, and, more 
generally, neuregulins. We have discovered that these 

15 compounds are capable of inducing both the proliferation 
of muscle cells and the differentiation and survival of 
myotubes. These phenomena may occur in cardiac and 
smooth muscle tissues in addition to skeletal muscle 
tissues. Thus, the above compounds, regulatory compounds 

20 which induce synthesis of these compounds, and small 

molecules which mimic these compounds by binding to the 
receptors on muscle or by stimulating through other means 
the second messenger systems activated by the ligand- 
receptor complex are all extremely useful as prophylactic 

25 and affirmative therapies for muscle diseases. 

A novel aspect of the invention involves the use 
of the above named proteins as growth factors to induce 
the mitogenesis, survival, growth and differentiation of 
muscle cells. Treating of the muscle cells to achieve 

30 these effects may be achieved by contacting muscle cells 
with a polypeptide described herein. The treatments may 
be provided to slow or halt net muscle loss or to 
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increase the amount or quality f muscle pr v nt in the 
vertebrate. 

These factors may be used to produce muscle cell 
mitogenesis, differentiation, and survival in a 
5 vertebrate (preferably a mammal, more preferably a human) 
by administering to the vertebrate an effective amount of 
a polypeptide or a related compound. Neuregulin effects 
on muscle may occur, for example, by causing an increase 
in muscle performance by inducing the synthesis of 

10 particular isoforms of the contractile apparatus such as 
the myosin heavy chain slow and fast isoforms; by 
promoting muscle fiber survival via the induction of 
synthesis of protective molecules such as, but not 
limited to, dystrophin; and/or by increasing muscle 

15 innervation by, for example, increasing acetylcholine 
receptor molecules at the neuromuscular junction. 

The term muscle cell as used herein refers to any 
cell which contributes to muscle tissue. Myoblasts, 
satellite cells, myotubes, and myofibril tissues are all 

20 included in the term H muscle cells" and may all be 

treated using the methods of the invention. Muscle cell 
effects may be induced within skeletal, cardiac and 
smooth muscles. 

Mitogenesis may be induced in muscle cells, 

25 including myoblasts or satellite cells, of skeletal 

muscle, smooth muscle or cardiac muscle. Mitogenesis as 
used herein refers to any cell division which results in 
the production of new muscle cells in the patient. More 
specifically, mitogenesis in vitro is defined as an 

30 increase in mitotic index relative to untreated cells of 
50%, more preferably 100%, and most preferably 300%, when 
the cells are exposed to labelling agent for a time 
equivalent to two doubling times. The mitotic index is 
the fraction of cells in the culture which have labelled 
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nuclei when grown in the presence of a tracer which only 
incorporates during S phase (i.e., BrdU) and the doubling 
time is defined as the average time required for the 
number of cells in the culture to increase by a factor of 
5 two) . 

An effect on mitogenesis in vivo is defined as an 
increase in satellite cell activation as measured by the 
appearance of labelled satellite cells in the muscle 
tissue of a mammal exposed to a tracer which only 

10 incorporates during S phase (i.e., BrdU). In useful 
therapeutic is defined in vivo as a compound which 
increases satellite cell activation relative to a control 
mammal by at least 10% r more preferably by at least 50%, 
and most preferably by more than 200% when the mammal is 

15 exposed to labelling agent for a period of greater than 
15 minutes and tissues are assayed between 10 hours and 
24 hours after administration of the mitogen at the 
therapeutic dose. Alternatively, satellite cell 
activation in vivo may be detected by monitoring the 

20 appearance of the intermediate filament vimentin by 

immunological or RNA analysis methods. When vimentin is 
assayed, the useful mitogen is defined as one which 
causes expression of detectable levels of vimentin in 
the muscle tissue when the therapeutically useful dosage 

25 is provided. 

Myogenesis as used herein refers to any fusion of 
myoblasts to yield myotubes. Most preferably, an effect 
on myogenesis is defined as an increase in the fusion of 
myoblasts and the enablement of the muscle 

30 differentiation program. The useful myogenic therapeutic 
is defined as a compound which confers any increase in 
the fusion index in vitro. More preferably, the compound 
confers at least a 2.0-fold increase and, most 
preferably, the compound confers a 3 -fold or greater 
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increase in the fusion index relative to the control. 
The fusion index is defined as the fraction of nuclei 
present in multinucleated cells in the culture relative 
to the total number of nuclei present in the culture. 
5 The percentages provided above are for cells assayed 

after 6 days of exposure to the myogenic compound and are 
relative to an untreated control. Myogenesis may also be 
determined by assaying the number of nuclei per area in 
myotubes or by measurement of the levels of muscle 

10 specific protein by Western analysis. Preferably, the 
compound confers at least a 2.0-fold increase in the 
density of myotubes using the assay provided, for 
example, herein, and, most preferably, the compound 
confers a 3 -fold or greater increase. 

15 The growth of muscle may occur by the increase in 

the fiber size and/or by increasing the number of fibers. 
The growth of muscle as used herein may be measured by A) 
an increase in wet weight, B) an increase in protein 
content, C) an increase in the number of muscle fibers, 

20 or D) an increase in muscle fiber diameter. An increase 
in growth of a muscle fiber can be defined as an increase 
in the diameter where the diameter is defined as the 
minor axis of ellipsis of the cross section. The useful 
therapeutic is one which increases the wet weight, 

25 protein content and/or diameter by 10% or more, more 
preferably by more than 50% and most preferably by more 
than 100% in an animal whose muscles have been previously 
degenerated by at least 10% and relative to a similarly 
treated control animal (i.e., an animal with degenerated 

30 muscle tissue which is not treated with the muscle growth 
compound). A compound which increases growth by 
increasing the number of muscle fibers is useful as a 
therapeutic when it increases the number of fibers in the 
diseased tissue by at least 1%, more preferably at least 
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20%, and most pr ferably, by at least 50%. These 
percentages are determined relative to the basal level in 
a comparable untreated undiseased mammal or in the 
contralateral undiseased muscle when the compound is 
5 administered and acts locally. 

The survival of muscle fibers as used herein 
refers to the prevention of loss of muscle fibers as 
evidenced by necrosis or apoptosis or the prevention of 
other mechanisms of muscle fiber loss. Survival as used 

10 herein indicates an decrease in the rate of cell death of 
at least 10%, more preferably by at least 50%, and most 
preferably by at least 300% relative to an untreated 
control. The rate of survival may be measured by 
counting cells stainable with a dye specific for dead 

15 cells (such as propidium iodide) in culture when the 

cells are 8 days post-differentiation (i.e., 8 days after 
the media is changed from 20% to 0.5% serum). 

Muscle regeneration as used herein refers to the 
process by which new muscle fibers form from muscle 

20 progenitor cells. The useful therapeutic for 

regeneration confers an increase in the number of new 
fibers by at least 1%, more preferably by at least 20%, 
and most preferably by at least 50%, as defined above. 

The differentiation of muscle cells as used herein 

25 refers to the induction of a muscle developmental program 
which specifies the components of the muscle fiber such 
as the contractile apparatus (the myofibril) . The 
therapeutic useful for differentiation increases the 
quantity of any component of the muscle fiber in the 

30 diseased tissue by at least 10% or more, more preferably 
by 50% or more, and most preferably by more than 100% 
relative to the equivalent tissue in a similarly treated 
control animal. 
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Atrophy of muscle as used herein refers to a 
significant loss in muscle fiber girth. By significant 
atrophy is meant a reduction of muscle fiber diameter in 
diseased, injured or unused muscle tissue of at least 10% 
5 relative to undiseased, uninjured, or normally utilized 
tissue. 

Methods for treatment of diseases or disorders 
using the polypeptides or other compounds described 
herein are also part of the invention. Examples of 

10 muscular disorders which may be treated include skeletal 
muscle diseases and disorders such as myopathies, 
dystrophies, myoneural conductive diseases, traumatic 
muscle injury, and nerve injury. Cardiac muscle 
pathologies such as cardiomyopathies, ischemic damage, 

15 congenital disease, and traumatic injury may also be 

treated using this methods of the invention, as may smooth 
muscle diseases and disorders such as arterial sclerosis, 
vascular lesions, and congenital vascular diseases. For 
example, Duchennes muscular dystrophy, Beckkers ' 

20 dystrophy, and Myasthenia gravis are but three of the 
diseases which may be treated using the methods of the 
invention. 

The invention also includes methods for the 
prophylaxis or treatment of a tumor of muscle cell origin 

25 such as rhabdomyosarcoma. These methods include 

administration of an effective amount of a substance 
which inhibits the binding of one or more of the 
polypeptides described herein and inhibiting the 
proliferation of the cells which contribute to the tumor. 

30 The methods of the invention may also be used to 

treat a patient suffering from a disease caused by a lack 
of a neurotrophic factor. By lacking a neurotrophic 
factor is meant a decreased amount of neurotrophic factor 
relative to an unaffected individual sufficient to cause 
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d tectable decrease in neuromuscular connections and/or 
muscular strength* The neurotrophic factor may be 
present at levels 10% below those observed in unaffected 
individuals* More preferably, the factor is present at 
5 levels 20% lover than are observed in unaffected 

individuals, and most preferably the levels are lowered 
by 80% relative to unaffected individuals under similar 
circumstances . 

The methods of the invention make use of the fact 

10 that the neuregulin proteins are encoded by the same 

gene. A variety of messenger RNA splicing variants (and 
their resultant proteins) are derived from this gene and 
many of these products show binding to PISS*** 82 and 
activation of the same. Products of this gene have been 

15 used to show muscle cell mitogenic activity (see Examples 
1 and 2, below), differentiation (Examples 3 and 6), and 
survival (Examples 4 and 5) • This invention provides a 
use for all of the known products of the neuregulin gene 
(described herein and in the references listed above) 

20 which have the stated activities as muscle cell mitogens, 
differentiation factors, and survival factors. Most 
preferably, recombinant human GGF2 (rhGGF2)is used in 
these methods. 

The invention also relates to the use of other, 

25 not yet naturally isolated, splicing variants of the 
neuregulin gene. Pig. 29 shows the known patterns of 
splicing. These patterns are derived from polymerase 
chain reaction experiments (on reverse transcribed RNA) , 
analysis of cONA clones (as presented within) , and 

30 analysis of published sequences encoding neuregulins 
(Peles et al., Cell £2:205 (1992) and Wen et al., Cell 
£2:559 (1992)). These patterns, as well as additional 
patterns disclosed herein, represent probable splicing 
variants which exist. The splicing variants are fully 
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described in Goodearl et al., USSN 08/036,555, filed 
March 24, 1993, incorporated herein by reference* 

More specifically, cell division, survival, 
differentiation and growth of muscle cells may be 
achieved by contacting muscle cells with a polypeptide 
defined by the formula 

WBAZCX 

wherein WBAZCX is composed of the polypeptide 
segments shown in Fig. 30 (SEQ ID Nos. 132, 134, 135, 
137-139, 156); wherein W comprises the polypeptide 
segment F, or is absent; wherein Z comprises the 
polypeptide segment 6 or is absent; and wherein X 
comprises the polypeptide segment C/D HKL, c/D H, C/D HL, 
C/D D, C/D' HL, C/D' HKL, C/D' H, C/D' D, C/D C/D' HKL, 
C/D C/D' H, C/D C/D' HL, C/D C/D' D, C/D D' H, C/D D' HL, 
C/D D' HKL, C/D' D' H, C/D' D' HL, C/D' D' HKL, C/D C/D' 
D' H, C/D C/D' D' HL, or C/D C/D' D' HKL and/or by 
contacting muscle cells with a polypeptide defined by the 
formula 

YBAZCX 

wherein YBAZCX is composed of the polypeptide 
segments shown in Fig. 30 (SEQ ID Nos. 133-135, 156, 
159) ; wherein Y comprises the polypeptide segment E, or 
is absent; wherein Z comprises the polypeptide segment G 
or is absent; and wherein X comprises the polypeptide 
segment C/D HKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' HKL, 
C/D' H, C/D' D, C/D C/D' HKL, C/D C/D' H, C/D C/D' HL, 
C/D C/D' D, C/D D' H, C/D D' HL, C/D D' HKL, C/D' D' H, 
C/D' D' HL, C/D' D' HKL, C/D C/D' D' H, C/D C/D' D' HL, 
or C/D C/D' D' HKL. 

Generally, the N-terminus of the above-described 
polypeptides begins with either the F or E polypeptide 
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segments. When the F p lyp ptide is present it may be 
cleaved upon maturation of the protein to yield the 
mature polypeptide. When the E sequence is present the 
first 50 amino acids which represent the N- terminal 
5 signal sequence may be absent from the polypeptides. 

Furthermore, the invention includes a method of 
treating muscle cells by the application to the muscle 
cell of a 

-30 kD polypeptide factor isolated from the MDA-MB 
10 231 human breast cell line; or 

-35 kD polypeptide factor isolated from the rat I- 
EJ transformed fibroblast cell line to the glial cell or 

-75 kD polypeptide factor isolated from the SKBR-3 
human breast cell line; or 
15 -44 kD polypeptide factor isolated from the rat I- 

EJ transformed fibroblast cell line; or 

-25 kD polypeptide factor isolated from activated 
mouse peritoneal macrophages; or 

-45 kD polypeptide factor isolated from the MDA - 
20 MB 231 human breast cell; or 

-7 to 14 kD polypeptide factor isolated from the 
ATL-2 human T-cell line to the glial cell; or 

-25 kD polypeptide factor isolated from the bovine 
kidney cells; or 
25 -42 kD ARIA polypeptide factor isolated from 

brain; -46-47 kD polypeptide factor which stimulates 

0-2 A glial progenitor cells; or 

-43-45 kD polypeptide factor, GGFIII,175 
U.S. patent application Serial No. 07/931,041, filed 
30 August 17, 1992, incorporated herein by reference. 



The invention further includes methods for the use 
Of the EGFL1, E6FL2, EGFL3, E6FL4, EGFL5, and EGFL6 
polypeptides, Fig. 37 to 42 and SEQ ID Nos. 150 to 155, 
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r spectively, f r th treatment of muscl cells in vivo 
and in vitro - 

Also included in the invention is the 
administration of the GGF2 polypeptide whose sequence is 
5 shown in Fig. 44 for the treatment of muscle cells. 

An additional important aspect of the invention 
are methods for treating muscle cells using: 

(a) a basic polypeptide factor also known to have 
glial cell mitogenic activity , in the presence of fetal 
10 calf plasma, a molecular weight of from about 30 kD to 
about 36 kD, and including within its amino acid sequence 
any one or more of the following peptide sequences: 





F 


X 


G 


0 


A 


H 


T 


E 




(SEQ ID NO: 1) 




A 


S 


L 


A 


D 


E 


Y 


E 


Y M X K 


(SEQ ID NO: 2) 


15 


T 


E 


T 


S 


S 


S 


G 


L 


X L K 


(SEQ ID NO: 3) 




A 


S 


L 


A 


0 


E 


Y 


E 


Y M R K 


(SEQ ID NO: 7) 




A 


6 


Y 


F 


A 


E 


X 


A 


R 


(SEQ ID NO: 11) 




T 


T 


E 


M 


A 


S 


E 


Q 


G A 


(SEQ ID NO: 13) 




A 


K 


E 


A 


L 


A 


A 


L 


K 


(SEQ ID NO: 14) 


20 


P 


V 


L 


Q 


A 


K 


K 






(SEQ ID NO: 15) 




B 


T 


Q 


P 


D 


P 


G 


Q 


ILKKVPMVI 


G A Y T 






















(SEQ ID NO: 165) 




E 


Y 


K 


C 


L 


K 


F 


K 


WFKKATVN 


(SEQ ID NO: 17) 




E 


X 


K 


F 


Y 


V 


P 






(SEQ ID NO: 19) 


25 


K 


L 


E 


F 


L 


X 


A 


K 


(SEQ ID NO: 32); 


and 



(b) a basic polypeptide factor for use in 
treating muscle cells which is also known to stimulate 
glial cell mitogenesis in the presence of fetal calf 
plasma, has a molecular weight of from about 55 kD to 
30 about 63 kD, and including within its amino acid sequence 
any one or more of the following peptide sequences: 
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VHQVWAAK 
YIFFMEPEAXSSG 



(SEQ ID NO: 33) 
(SEQ ID NO: 34) 
(SEQ ID NO: 35) 
(SEQ ID NO: 36) 
(SEQ ID NO: 37) 
(SEQ ID NO: 38) 
(SEQ ID NO: 48) 
(SEQ ID NO: 49) 
(SEQ ID NO: 39) 



LGAWGPPAFPVXY 



WFVVIEGK 



5 



ASPVSVGSVQELQR 
VCLLTVAALPPT 
KVHQVWAAK 
KASLADSGEYMXK 



D L L L X V 



10 



Methods for the use of the peptide sequences set 



out above, derived from the smaller molecular weight 
polypeptide factor, and from the larger molecular weight 
polypeptide factor, are also aspects of this invention. 
Monoclonal antibodies to the above peptides are 

15 themselves useful investigative tools and therapeutics. 

Thus, the invention further embraces methods of 
using a polypeptide factor having activities useful for 
treating muscle cells and including an amino acid 
sequence encoded by: 

20 (a) a DNA sequence shown in any one of Figs. 27 A, 

27B or 27C, SEQ ID Nos. 129-131, respectively; 

(b) a DNA sequence shown in Fig. 21, SEQ ID No. 



25 281-557 of the sequence shorn in Fig. 27A, SEQ ID No. 
129; or 

(d) a DNA sequence hybridizable to any one of the 
DNA sequences according to (a) , (b) or (c) • 

Following factors as muscle cell mitogens: 
30 (a) a basic polypeptide factor which has, if 

obtained from bovine pituitary material, an observed 
molecular weight, whether in reducing conditions or not, 
of from about 30kD to about 36kD on SDS-polyacrylamide 



85; 



(c) the DNA sequence represented by nucleotides 
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gel electrophoresis which factor has muscle cell 
mitogenic activity including stimulating the division of 
myoblasts, and when isolated using reversed-phase HPLC 
retains at least 50% of said activity after 10 weeks 
5 incubation in 0.1% trifluoroacetic acid at 4°C; and 
(b) a basic polypeptide factor which has, if 
obtained from bovine pituitary material, an observed 
molecular weight, under non-reducing conditions, of from 
about 55 kO to about 63 Kd on SDS-polyacrylamide gel 

10 electrophoresis which factor the human equivalent of 
which is encoded by DNA clone 6GF2HBS5 and which factor 
has muscle cell mitogenic activity and when isolated 
using reversed-phase HPLC retains at least 50% of the 
activity after 4 days incubation in 0.1% trifluoroacetic 

15 acid at 4«C. 

Thus other important aspects of the invention are 
the use of: 

(a) A series of human and bovine polypeptide 
factors having cell mitogenic activity including 

20 stimulating the division of muscle cells. These peptide 
sequences are shown in Figs. 30, 31, 32 and 33, SEQ ID 
Nos. 132-133, respectively. 

(b) A series of polypeptide factors having cell 
mitogenic activity including stimulating the division of 

25 muscle cells and purified and characterized according to 
the procedures outlined by Lupu et al. Science 212: 1552 
(1990); Lupu et al. Proc. Natl. Acad. Sci USA ££: 2287 
(1992); Holmes et al. Science 2££: 1205 (1992); Peles et 
al. ££: 205 (1992); Yarden and Peles Biochemistry 2£: 

30 3543 (1991); Dobashi et al. Proc. Natl. Acad. Sci. fi£: 
8582 (1991); Davis et al. Biochem. Biophys. Res. Commun. 
179 ; 1536 (1991); Beaumont et al., patent application 
PCT/US91/03443 (1990); Bottenstein, U.S. Patent No. 
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5,276,145, issu d 1/4/94; and Green et al. patent 
application PCT/US91/02331 (1990). 

(c) A polypeptide factor (GGFBPP5) having glial 
cell mitogenic activity including stimulating the 
5 division of muscle cells. The amino acid sequence is 
shown in Fig. 31, SEQ ID No. 144. 

Methods for stimulating mitogenesis of a myoblast 
by contacting the myoblast cell with a polypeptide 
defined above as a muscle cell mitogen in vivo or in 
10 vitro are included as features of the invention. 

Muscle cell treatments may also be achieved by 
administering DNA encoding the polypeptide compounds 
described above in an expressible genetic construction. 
DNA encoding the polypeptide may be administered to the 
15 patient using techniques known in the art for delivering 
DNA to the cells. For example, retroviral vectors, 
electroporation or liposomes may be used to deliver DNA. 

The invention includes the use of the above named 
family of proteins as extracted from natural sources 
20 (tissues or cell lines) or as prepared by recombinant 
means. 

other compounds in particular, peptides, which 
bind specifically to the pl85* rbB2 receptor can also be 
used according to the invention as muscle cell mitogens. 

25 A candidate compound can be routinely screened for 

pl85* rbM binding, and, if it binds, can then be screened 
for glial cell mitogenic activity using the methods 
described herein. 

The invention includes use of any modifications or 

30 equivalents of the above polypeptide factors which do not 
exhibit a significantly reduced activity. For example, 
modifications in which amino acid content or sequence is 
altered without substantially adversely affecting 
activity are included. The statements of effect and use 
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contained herein are therefore to be construed 
accordingly, with such uses and effects employing 
modified or equivalent factors being part of the 
invention. 

5 The human peptide sequences described above and 

presented in Figs. 30, 31, 32 and 33, SEQ ID Nos. 132- 
146, respectively, represent a series of splicing 
variants which can be isolated as full length 
complementary DNAs (cDNAS) from natural sources (cDNA 
10 libraries prepared from the appropriate tissues) or can 
be assembled as DNA constructs with individual exons 
(e.g., derived as separate exons) by someone skilled in 
the art. 

The invention also includes a method of making a 
15 medicament for treating muscle cells, i.e., for inducing 
muscular mitogenesis, myogenesis, differentiation, or 
survival, by administering an effective amount of a 
polypeptide as defined above. Such a medicament is made 
by administering the polypeptide with a pharmaceutically 
20 effective carrier. 

Another aspect of the invention is the use of a 
pharmaceutical or veterinary formulation comprising any 
factor as defined above formulated for pharmaceutical or 
veterinary use, respectively, optionally together with an 
25 acceptable diluent, carrier or excipient and/or in unit 
dosage form. In using the factors of the invention, 
conventional pharmaceutical or veterinary practice may be 
employed to provide suitable formulations or 
compositions. 

30 Thus, the formulations to be used as a part of the 

invention can be applied to parenteral administration, 
for example, intravenous, subcutaneous, intramuscular, 
intraorbital, ophthalmic, intraventricular, intracranial, 
intracapsular , intraspinal , intracisternal , 
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intraperit neal, topical, intranasal, a r sol, 
scarification, and also oral, buccal, rectal or vaginal 
administration • 

The formulations of this invention may also be 
5 administered by the transplantation into the patient of 
host cells expressing the DNA encoding polypeptides which 
are effective for the methods of the invention or by the 
use of surgical implants which release the formulations 
of the invention. 

10 Parenteral formulations may be in the form of 

liquid solutions or suspensions; for oral administration, 
formulations may be in the form of tablets or capsules; 
and for intranasal formulations, in the form of powders, 
nasal drops, or aerosols. 

15 Methods well known in the art for making 

formulations are to be found in, for example, 
"Remington's Pharmaceutical Sciences. 11 Formulations for 
parenteral administration may, for example, contain as 
excipients sterile water or saline, polyalkylene glycols 

20 such as polyethylene glycol, oils of vegetable origin, or 
hydrogenated naphthalenes, biocompatible, biodegradable 
lactide polymer, or polyoxyethylene-polyoxypropylene 
copolymers may be used to control the release of the 
present factors* Other potentially useful parenteral 

25 delivery systems for the factors include ethylene-vinyl 
acetate copolymer particles, osmotic pumps, implantable 
infusion systems, and liposomes. Formulations for 
inhalation may contain as excipients, for example, 
lactose, or may be aqueous solutions containing, for 

30 example, polyoxyethylene-9-lauryl ether, glycocholate and 
deoxycholate, or may be oily solutions for administration 
in the form of nasal drops, or as a gel to be applied 
intranasally. Formulations for parenteral administration 
may also include glycocholate for buccal administration, 
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methoxysalicylate for rectal administration, or citric 
acid for vaginal administration. 

The present factors can be used as the sole active 
agents, or can be used in combination with other active 
5 ingredients, e.g., other growth factors which could 

facilitate neuronal survival in neurological diseases, or 
peptidase or protease inhibitors. 

The concentration of the present factors in the 
formulations of the invention will vary depending upon a 
10 number of issues, including the dosage to be 
administered, and the route of administration. 

In general terms, the factors of this invention 
may be provided in an aqueous physiological buffer 
solution containing about 0.1 to 10% w/v compound for 
15 parenteral administration. General dose ranges are from 
about 1 mg/kg to about 1 g/Jcg of body weight per day; a 
preferred dose range is from about 0.01 mg/kg to 100 
mg/kg of body weight per day. The preferred dosage to be 
administered is likely to depend upon the type and extent 
20 of progression of the pathophysiological condition being 
addressed, the overall health of the patient, the make up 
of the formulation, and the route of administration. 

The polypeptide factors utilized in the methods of 
the invention can also be used as immunogens for making 
25 antibodies, such as monoclonal antibodies, following 
standard techniques. These antibodies can, in turn, be 
used for therapeutic or diagnostic purposes. Thus, 
conditions perhaps associated with muscle diseases 
resulting from abnormal levels of the factor may be 
30 tracked by using such antibodies. In vitro techniques 
can be used, employing assays on isolated samples using 
standard methods. Imaging methods in which the 
antibodies are, for example, tagged with radioactive 
isotopes which can be imaged outside the body using 
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t chniques f r th art of tumor imaging may also be 
employed. 

A further general aspect of the invention is the 
use of a factor of the invention in the manufacture of a 
5 medicament, preferably for the treatment of a muscular 
disease or disorder. The "GGF2" designation is used for 
all clones which were previously isolated with peptide 
sequence data derived from GGF-IX protein (i.e., 
66P2HBS5, 66F2BPP3) and, when present alone (i.e., GGF2 

10 or rhG6F2), to indicate recombinant human protein encoded 
by plasmids isolated with peptide sequence data derived 
from the GGF-II protein (i.e., as produced in insect 
cells from the plasmid HBS5) . Recombinant human GGF from 
the G6FHBS5 clone is called GGF2, rhG6F2 and GGF2HBS5 

15 polypeptide. 

Treating as used herein means any administration 
of the compounds described herein for the purpose of 
increasing muscle cell mitogenesis, survival, and/or 
differentiation, and/or decreasing muscle atrophy and 

20 degeneration. Most preferably, the treating is for the 
purpose of reducing or diminishing the symptoms or 
progression of a disease or disorder of the muscle cells. 
Treating as used herein also means the administration of 
the compounds for increasing or altering the muscle cells 

25 in healthy individuals. The treating may be brought 
about by the contacing of the muscle cells which are 
sensitive or responsive to the compounds described herein 
with an effective amount of the compound, as described 
above. Inhibitors of the compounds described herein may 

30 also be used to halt or slow diseases of muscle cell 
proliferation • 
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Brief Pwcrlptlon Qt the Prayings 

The drawings will first be described. 
Drawings 

Fig. 1 is a graph shoving the results of rhGGF2 in 
5 a myoblast mitogenesis assay. 

Fig. 2 is a graph shoving the effect of rhGGF2 on 
the number of nuclei in my o tubes. 

Fig. 3 is a graph of a survival assay shoving the 
effect of rhGGF2 on survival of differentiated myotubes. 
10 Fig. 4 is a graph of survival assays shoving the 

effect of rhGGF2 on differentiated myotubes relative to 
human platelet derived growth factor, human fibroblast 
grovth factor, human epidermal growth factor, human 
leucocyte inhibitory factor, and human insulin-like 
15 grovth factors I and II. 

Fig. 5 is a graph shoving the increased survival 
on Duchenne muscular dystrophy cells in the presence of 
rh<SGF2. Fig. 6 is a graph of increasing human grovth 
hormone (hGH) expression in C2 cells from an hGH reporter 
20 gene under control of the AchR delta subunit 

transcriptional control elements. This increase is tied 
to the addition of GGF2 to the media. 

Fig. 7 is a graph of increasing hGH reporter 
synthesis and bungarotoxin (BTX) binding to AchRs 
25 following the addition of increasing amounts of GGF2 to 
C2 cells. 

Figs. 8, 9, 10 and 11 are the peptide sequences 
derived from GGF-I and GGF-II, SEQ ID Nob. 1-20, 22-29, 
32-50 and 165, (see Examples 11-13 hereinafter). 
30 Fig. 9, Panel A, is the sequences of GGF-I 

peptides used to design degenerate oligonucleotide probes 
and degenerate PCR primers are listed (SEQ ID Nos. 1, 17 
and 22-29) . Some of the sequences in Panel A vere also 
used to design synthetic peptides. Panel B is a listing 
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of the sequences of novel peptid s that were too short 
(less than 6 amino acids) for the design of degenerate 
probes or degenerate PCR primers (SEQ ID Nos. 17 and 32); 

Pig. 11, Panel A, is a listing of the sequences of 
5 GGF-II peptides used to design degenerate oligonucleotide 
probes and degenerate PCR primers (SEQ ID Nos. 42-49) . 
some of the sequences in Panel A were used to design 
synthetic peptides. Panel B is a listing of the novel 
peptide that was too short (less than 6 amino acids) for 

10 the design of degenerate probes or degenerate PCR primers 
(SEQ ID No. 50); 

Figs. 12, 13A, 13B, 14, 15, 16, 17, 18, and 19 
relate to Example 8, below, and depict the mitogenic 
activity of factors of the invention; 

15 Figs. 20, 21, 22, 23, 24, 25, 26, and 27 relate to 

Example 10* below and are briefly described below: 

Fig. 20 is a listing of the degenerate 
oligonucleotide probes (SEQ ID Nos. 51-84) designed from 
the novel peptide sequences in Figure 7, Panel A and 

20 Figure 9, Panel A; 

Fig. 21 (SEQ ID No. 85) depicts a stretch of the 
putative bovine GGF-II gene sequence from the recombinant 
bovine genomic phage GGF2BG1, containing the binding site 
of degenerate oligonucleotide probes 609 and 650 (see 

25 Figure 18, SEQ ID NOs. 66 and 69, respectively). The 
figure is the coding strand of the DNA sequence and the 
deduced amino acid sequence in the third reading frame. 
The sequence of peptide 12 from factor 2 (bold) is part 
of a 66 amino acid open reading frame (nucleotides 

30 75272); 

Fig. 22 is the degenerate PCR primers (Panel A, 
SEQ IS Nos. 86-104) and unique PCR primers (Panel B, SEQ 
ID Nos. 105-115) used in experiments to isolate segments 
•of the bovine GGF-II coding sequences present in RNA from 
35 posterior pituitary; 
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Fig. 23 depicts of the nine distinct contiguous 
bovine GGF-II cDNA structures and sequences that were 
obtained in PGR amplification experiments. The top line 
of the Figure is a schematic of the coding sequences 
5 which contribute to the cDNA structures that were 
characterized; 

Fig. 24 is a physical map of bovine recombinant 
phage of 6GF2BG1 • The bovine fragment is roughly 20 kb 
in length and contains two exons (bold) of the bovine 

10 GGF-II gene. Restriction sites for the enzymes Xbal, 
Spel, Ndel, EcoRI, Kpnl, and SstI have been placed on 
this physical map. Shaded portions correspond to 
fragments which were subcloned for sequencing; 

Fig. 25 is a schematic of the structure of three 

15 alternative gene products of the putative bovine GGF-II 
gene. Exons are listed A through E in the order of their 
discovery. The alternative splicing patterns 1, 2 and 3 
generate three overlapping deduced protein structures 
(GGF2BPP1, 2, and 3), which are displayed in the various 

20 Figs. 27A, 27B, 27C (described below); 

Fig. 26 (SEQ ID Nos. 116-128) is a comparison of 
the GGF-I and GGF-II sequences identified in the deduced 
protein sequences shown in Figs. 27A, 27B, 27C (described 
below) with the novel peptide sequences listed in Figs. 9 

25 and 11. The Figure shows that six of the nine novel 
GGF-II peptide sequences are accounted for in these 
deduced protein sequences. Two peptide sequences similar 
to GGF-I sequences are also found; 

Fig. 27 (SEQ ID No. 129) is a listing of the 

30 coding strand DNA sequence and deduced amino acid 
sequence of the cDNA obtained from splicing pattern 
number l in Figure 25. This partial cDNA of the putative 
bovine GGF-II gene encodes a protein of 206 amino acids 
in length. Peptides in bold were those identified from 
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th lists presented in Figs. 9 and 11. P tential 
glycosylation sites are underlined (along with 
polyadenylation signal AATAAA) ; 

Fig. 27 (SEQ ID No. 130) is a listing of the 
5 coding strand DNA sequence and deduced amino acid 
sequence of the cDNA obtained from splicing pattern 
number 2 in Fig. 25. This partial cDNA of the putative 
bovine GGF-IX gene encodes a protein of 281 amino acids 
in length. Peptides in bold are those identified from 

10 the lists presented in Figs. 7 and 9. Potential 
glycosylation sites are underlined (along with 
polyadenylation signal AATAAA) ; 

Fig. 27 (SEQ ID No. 131) is a listing of the 
coding strand DNA sequence and deduced amino acid 

15 sequence of the cDNA obtained from splicing pattern 

number 3 in Fig. 25. This partial cDNA of the putative 
bovine GGF-II gene encodes a protein of 257 aaino acids 
in length. Peptides in bold are those identified from 
the lists in Figs. 9 and 11. Potential glycosylation 

20 sites are underlined (along with polyadenylation signal 
AATAAA) . 

Fig. 28 , which relates to Example 16 hereinafter, 
is an autoradiogram of a cross hybridization analysis of 
putative bovine GGF-II gene sequences to a variety of 

25 mammalian DNAs on a southern blot. The filter contains 
lanes of EcoRI-digested DNA (5 jig per lane) from the 
species listed in the Figure. The probe detects a single 
strong band in each DNA sample, including a four kilobase 
fragment in the bovine DNA as anticipated by the physical 

30 map in Fig. 24. Bands of relatively minor intensity are 
observed as well, which could represent related DNA 
sequences. The strong hybridizing band from each of the 
other mammalian DNA samples presumably represents the 
GGF-II homologue of those species. 
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Fig. 29 is a diagram of representative splicing 
variants* The coding segments are represented by F, E, 
B, A, G, C, C/D, C/D', D, D', H, K and L. The location 
of the peptide sequences derived from purified protein 
5 are indicated by "o". 

Fig. 30 (SEQ ID NOS. 136-143, 156, 157, 169-178) 
is a listing of the DNA sequences and predicted peptide 
sequences of the coding segments of GGF. Line 1 is a 
listing of the predicted amino acid sequences of bovine 
10 GGF, line 2 is a listing of the nucleotide sequences of 
bovine GGF, line 3 is a listing of the nucleotide 
sequences of human GGF (heregulin) (nucleotide base 
matches are indicated with a vertical line) and line 4 is 
a listing of the predicted amino acid sequences of human 
15 GGF/heregulin where it differs from the predicted bovine 
sequence. Coding segments E, A' and K represent only the 
bovine sequences. Coding segment D' represents only the 
human (heregulin) sequence. 

Fig. 31 (SEQ ID No. 144) is the predicted GGF2 
20 amino acid sequence and nucleotide sequence of BPP5. The 
upper line is the nucleotide sequence and the lover line 
is the predicted amino acid sequence. 

Fig. 32 (SEQ ID No. 145) is the predicted amino 
acid sequence and nucleotide sequence of GGF2BPP2 • The 
25 upper line is the nucleotide sequence and the lover line 
is the predicted amino acid sequence . 

Fig. 33 (SEQ ID No. 146) is the predicted amino 
acid sequence and nucleotide sequence of GGF2BPP4 • The 
upper line is the nucleotide sequence and the lover line 
30 is the predicted amino acid sequence. 

Fig. 34 (SEQ ID Nos. 147-149) depicts the 
alignment of two GGF peptide sequences (GGF2BPP4 and 
GGF2BPP5) vith the human EGF (hEGF) . Asterisks indicate 
positions of conserved cysteines. 
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Pig. 35 depicts the 1 v 1 of GGF activity (Schwann 
cell mitogenic assay) and tyrosine phosphorylation of a 
ca. 200k0 protein (intensity of a 200 kO band on an 
autoradiogram of a Western blot developed with an 
5 antiphosphotyrosine polyclonal antibody) in response to 
increasing amounts of GGF. 

Fig. 36 is a list of splicing variants derived 
from the sequences shown in Fig. 30. 

Fig. 37 is the predicted amino acid sequence , 
10 bottom, and nucleic sequence, top, of E6FL1 (SEQ ID No. 

150) . 

Fig. 38 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of E6FL2 (SEQ ID No. 

151) . 

15 Fig. 39 is the predicted amino acid sequence, 

bottom, and nucleic sequence, top, of E6FL3 (SEQ ID No. 

152) . 

Fig. 40 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of EGFL4 (SEQ ID No. 
20 153). 

Fig. 41 is the predicted amino acid sequence, 
bottom, and nucleic sequence, top, of EGFL5 (SEQ ID No. 
154) . 

Fig. 42 is the predicted amino acid sequence, 
25 bottom, and nucleic sequence, top, of EGFI#6 (SEQ ID No. 
159) . 

Fig* 43 is a scale coding segment map of the 
clone. T3 refers to the bacteriophage promoter used to 
produce mRNA from the clone. R « flanking EcoRI 
30 restriction enzyme sites. 5' UT refers to the 5' 

untranslated region. E, B, A, C, C/D' , and D refer to 
the coding segments. 0 - the translation start site. A 
■ the 5' limit of the region homologous to the bovine E 
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s gment (see Example 17) and 3' UT refers t th 3' 
untranslated region. 

Fig. 44 Is the predicted amino acid sequence 
(middle) and nucleic sequence (top) of GGF2HBS5 (SEQ ZD 
5 No. 21). The bottom (intermittent) sequence represents 
peptide sequences derived from GGF-II preparations (see 
Figs. 8, 9). 

Fig. 45 (A) is a graph shoving the purification of 
rGGF on cation exchange column by fraction; Fig. 45 (B) 
10 is a photograph of a Western blot using fractions as 
depicted in (A) and a GGFII specific antibody. 

Fig. 46 is the sequence of the GGFHBS5, GGFHFB1 
and GGFBPP5 polypeptides (SEQ ID NOS: 166, 167, and 168). 

Fig. 47 is a map of the plasmld pcDHRFpolyA • 

15 Detailed Description 

The invention pertains to the use of isolated and 
purified neuregulin factors and DNA sequences encoding 
these factors, regulatory compounds which increase the 
extramuscular concentrations of these factors, and 

20 compounds which are mimetics of these factors for the 
induction of muscle cell mitogenesis, differentiation, 
and survival of the muscle cells in vivo and in vitro. 

It is evident that the gene encoding GGF/pl85 #rbB2 
binding neuregulin proteins produces a number of 

25 variably-sized, differentially-spliced RNA transcripts 
that give rise to a series of proteins. These proteins 
are of different lengths and contain some common peptide 
sequences and some unique peptide sequences. The 
conclusion that these factors are encoded by a single 

30 gene is supported by the differentially-spliced RNA 
sequences which are recoverable from bovine posterior 
pituitary and human breast cancer cells (MDA-MB-231) ) * 
Further support for this conclusion derives from the size 
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rang of proteins which act as both mitogens for muscle 
tissue (as disclosed herein) and as ligands for the 
pl85 #rbB2 receptor (see below) . 

Further evidence to support the fact that the 
5 genes encoding GGF/pl85* rbB2 binding proteins are 

homologous comes from nucleotide sequence comparison* 
Holmes et al., (Science 256 :1205-1210. 1992) demonstrate 
the purification of a 45-kilodalton human protein 
(Heregulin-cr) which specifically interacts with the 

10 receptor protein pl85 #rbB2 . Peles et al. (Cell £2:205 
(1992)) and Wen et al. (Cell £2:559 (1992)) describe a 
complementary DNA isolated from rat cells encoding a 
protein called "neu differentiation factor 19 (NDF) ♦ The 
translation product of the NDF cDNA has pies* 1 * 82 binding 

15 activity. Several other groups have reported the 

purification of proteins of various molecular weights 
with plSS" 1 * 82 binding activity. These groups include 
Lupu et al. ((1992) Proc. Natl. Acad. Sci. USA 89:2287); 
Yarden and Peles ((1991) Biochemistry 30:3543); Lupu et 

20 al. ((1990) Science 249:1552)); Dobashi et al. ((1991) 
Biochem. Biophys. Res. Comm. 179:1536); and Huang et al. 
((1992) J. Biol. Chem. 257:11508*11512). 

We have found that pl85 #rbB2 receptor binding 
proteins stimulate muscle cell mitogenesis and hence, 

25 stimulates myotube formation (myogenesis) • This 

stimulation results in increased formation of myoblasts 
and increased formation of myotubes (myogenesis) • The 
compounds described herein also stimulate increased 
muscle growth, differentiation, and survival of muscle 

30 cells. These ligands include, but are not limited to the 
GGF's, the neuregulins, the heregulins, NDF, and ARIA. 
As a result of this mitogenic activity, these proteins, 
DNA encoding these proteins, and related compounds may be 
administered to patients suffering from traumatic damage 
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r diseases f th muscle tissu . It is understood that 
all methods provided for the purpose of mitogenesis are 
useful for the purpose of myogenesis. Inhibitors of 
these ligands (such as antibodies or peptide fragments) 
5 may be administered for the treatment of muscle derived 
tumors. 

These compounds may be obtained using the 
protocols described herein (Examples 9-17) and in Holmes 
et al., Science 2£&: 1205 (1992); Peles et al. , Cell 

10 £2:205 (1992); Wen et al., Cell £2:559 (1992); Lupu et 
al., Proc. Natl. Acad. Sci. OSA 89:2287 (1992); Yarden 
and Peles, Biochemistry 30:3543 (1991); Lupu et al., 
Science 249:1552 (1990); Dobashi et al., Biochem. 
Biophys. Res. Comm. 179:1536 (1991); Huang et al*, J . 

15 Biol. Chem. 257:11508*11512 (1992); Marchionni et al., 

Nature 362:313, (1993); and in the GGF-III patent, all of 
which are incorporated herein by reference. The 
sequences are provided and the characteristics described 
for many of these compounds. For sequences see Figs. 8* 

20 11, 20-27C, 29-34, 36-44, and 46. For protein 

characteristics see Figs. 12-19, 28 35, 45A and 45B. 

Compounds may be assayed for their usefulness In 
vitro using the methods provided in the examples below. 
In vivo testing may be performed as described in Example 

25 l and in Sklar et al., In Vitro Cellular and 
Developmental Biology 27A: 433-434, 1991. 

The invention includes methods for the use of any 
protein which is substantially homologous to the coding 
30 segments in Fig. 30 (SEQ ID Nos. 132-143, 156, 1576-147, 
160, and 161) as well as other naturally occurring GGF 
polypeptides for the purpose of inducing muscle 
mitogenesis. Also included are the use of: allelic 
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variations; natural mutants; indue d mutants; pr teins 
encoded by DNA that hybridizes tinder high or low 
stringency conditions to a nucleic acid naturally 
occurring (for definitions of high and low stringency see 
5 Current Protocols in Molecular Biology, John Wiley & 

Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated 
by reference) ; and the use of polypeptides or proteins 
specifically bound by antisera to GGF polypeptides. The 
term also includes the use of chimeric polypeptides that 

10 include the GGF polypeptides comprising sequences from 
Fig. 28 for the induction of muscle mitogenesis. 

As will be seen from Example 8, below, the present 
factors exhibit mitogenic activity on a range of cell 
types. The general statements of invention above in 

15 relation to formulations and/or medicaments and their 
manufacture should clearly be construed to include 
appropriate products and uses. 

A series of experiments follow which provide 
additional basis for the claims described herein. The 

20 following examples relating to the present invention 
should not be construed as specifically limiting the 
invention, or such variations of the invention, now known 
or later developed. 

The examples illustrate our discovery that 

25 recombinant human GGF2 (rhGGF2) confers several effects 
on primary human muscle culture. rhGGF2 has significant 
effects in three independent biological activity assays 
on muscle cultures. The polypeptide increased 
mitogenesis as measured by proliferation of subconf luent 

30 quiescent myoblasts, increased differentiation by 

confluent myoblasts in the presence of growth factor, and 
increased survival of differentiated myotubes as measured 
by loss of dye exclusion and increased acetylcholine 
receptor synthesis. These activities indicate efficacy 
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of 6GF2 and other neuregulins in inducing muscle repair, 
regeneration, and prophylactic effects on muscle 
degeneration. 

EXAMPLE 1 

5 Mltoqenic Activity of rhccr o n Myoblasts 

Clone GGF2HBS5 was expressed in recombinant 
Baculovirus infected insect cells as described in 
Example 14, infra, and the resultant recombinant human 
66F2 was added to myoblasts in culture (conditioned 

10 medium added at 40 m1/»1) . Myoblasts (057A cells) were 
grown to preconfluence in a 24 well dish. Medium was 
removed and replaced with DMEM containing 0.5% fetal calf 
serum with or without 66F2 conditioned medium at a 
concentration of 40 pl/ml. Medium was changed after 2 

15 days and cells were fixed and stained after 5 days. 

Total nuclei were counted as were the number of nuclei in 
myoblasts (Table 1) . 



TABLB 1 







Total Number 


Nuclei in 


Fusion 




Treatment 


of Nuclei /mm 2 


Myotubes 


Index 


20 


Control 


395 ± 28.3 


204 ± 9.19 


0.515 ± 0.01 




GGF 40Ml/ml 


636 ± 8.5 


381 ± 82.7 


0.591 1 0.15 



GGF treated myoblasts showed an increased number of total 
nuclei (636 nuclei) over untreated controls (395 nuclei) 
indicating mitogenic activity. rhGGF2 treated myotubes 
25 had a greater number of nuclei (381 nuclei) than 

untreated controls (204 nuclei). Thus, rhGGF2 enhances 
the total number of nuclei through proliferation and 
increased cell survival. rhGGF2 is also likely to 
enhance the formation of myotubes. 
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The mitog nic activity of rhGGF2 may be measured 
in vivo by giving a continuous supply of GGF2 and 
[ 3 H] thymidine to rat muscle via an osmotic mini pump. 
The muscle bulk is determined by vet weight after one and 
5 two weeks of treatment. DNA replication is measured by 
counting labeled nuclei in sections after coating for 
autoradiography (Sklar et al., In Vitro Cellular and 
Developmental Biology 27A: 433-434, 1991) in sham and 
rhGGF2 -treated muscle. Denervated muscle is also 
10 examined in this rat animal model via these methods and 
this method allows the assessment of the role of rhGGF2 
in muscle atrophy and repair. Mean fiber diameter can 
also be used for assessing effects of F6F on prevention 
of atrophy. 



15 EXAMPLE 2 

Etfret Qt rhggFZ on Muscle Cell Mitranegig 

Quiescent primary clonal human myoblasts were 
prepared as previously described (Sklar, R. , Hudson, A. , 
Brown, R. , In vitro Cellular and Developmental Biology 

20 1991; 27A:433-434) . The quiescent cells were treated 

with the indicated agents (rhGGF2 conditioned media, PDGF 
with and without methylprednisolone, and control media) 
in the presence of 10/iM BrdU, 0.5% FCS in DMEM. After 
two days the cells were fixed in 4% paraformaldehyde in 

25 PBS for 30 minutes, and washed with 70% ethanol. The 
cells were then incubated with an anti-BrdU antibody, 
washed, and antibody binding was visualized with a 
peroxidase reaction. The number of staining nuclei were 
then quantified per area. The results show that GGF2 

30 induces an increase in the number of labelled nuclei per 
area over controls (see Table 2). 
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TABUS 2 



Mitogenic Effects of GGF on Human Myoblasts 



Treatment 


Labelled 
Nuclei/cm 2 


T— Test 
p value 


Control 


120 ± 22.4 




Infected Control 


103 ± 11.9 




GGF 5 m1/b1 


223 ± 33.8 


0.019 


PDGF 20 ng/ml 


418 ± 45.8 


0.0005 


ZGFI 30 ng/ml 


280 ± 109.6 


0.068 


Methylprednisolone 1.0 pM 


142 ± 20.7 


0.293 



10 Platelet derived growth factor (PDGF) was used as a 

positive control, Methylprednisolone (a corticosteroid) 
vas also used in addition to rhG6F2 and shoved no 
significant increase in labelling of DNA. 

rhGGF2 purified to homogeneity (>95% pure) is also 

15 mitogenic for human myoblasts (Fig. 1) • 

Recombinant human GGF2 also causes mitogenesis of 
primary human myoblasts (see Table 2 and Fig. 1) ♦ The 
mitogenesis assay is performed as described above. The 
mitotic index is then calculated by dividing the number 

20 of BrdU positive cells by the total number of cells. 

EXAMPLE 3 

Effect of rhGGF2 on Muscle Cell Differentiation 

The effects of purified rh6GF2 (95% pure) on 
muscle culture differentiation were examined (Fig. 2) . 
25 Confluent myoblast cultures were induced to differentiate 
by lowering the serum content of the culture medium from 
20% to 0.5%. The test cultures were treated with the 
indicated concentration of rhGGF2 for six days, 
refreshing the culture medium every 2 days. The cultures 
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wer then fixed, stained, and the number of nucl i 
counted per millimeter. The data in Fig. 2 demonstrate a 
large increase in the number of nuclei in myotubes when 
rhGGF2 is present, relative to controls. 

5 EXAMPLE 4 

g*f«ct of rhGGF2 on the Survival of Differentiated 

Mvotubes 

The survival of differentiated myotubes was 
significantly increased by rh66F2 treatment. Muscle 

10 cultures were differentiated in the presence of rhGGF2 
and at various times the number of dead myotubes were 
counted by propidium iodide staining. As can be seen in 
Fig. 3, the number of dead myotubes is lower in the 
rhG6F2 treated culture at 4, 5, 6, and 8 days of 

15 differentiation. The number of nuclei in myotubes was 
significantly increased by GGF2 treatment compared to 
untreated cultures after 8 days of differentiation. 
Specifically, the control shoved 8.6 myonuclei/mm 2 , while 
rhGGF2 treated cultures showed 57.2 myonuclei/mm 2 

20 (p-0.035) when counted on the same plates after geimsa 
staining. 

The survival assay was also performed with other 
growth factors which have known effects on muscle 
culture. The rhGGF2 effect was unique among the growth 

25 factors tested (Fig. 4) . In this experiment cultures 
were treated in parallel with the rhGGF2 treated plates 
with the indicated concentrations of the various growth 
factors. Survival of myotubes was measured as above at 8 
days of differentiation of 057A myoblast cells. 

30 concentrations of factors were as follows: rhGGF2: 

lOOng/ml; human platelet derived growth factor: 20ng/ml; 
human basic fibroblast growth factor: 25ng/ml; human 
epidermal growth factor: 30ng/ml; human leucocyte 
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inhibitory factor: lOng/ml; human insulin like growth 
factor I: 30ng/ml; human insulin like growth factor II: 
25ng/ml. 

The observed protection of differentiated myotubes 
5 from death indicates that rh66F2 has promise as a therapy 
for intervention of muscle degeneration characterized by 
numerous muscle diseases. Thus, agents which increase 
the extramuscular concentration of neuregulins may have a 
prophylactic effect or slow the progress of muscle- 
10 wasting disorders and increase rates of muscle 

differentiation, repair, conditioning, and regeneration. 

rhSCF2 Promotes Survival of Differentiated Mvotuhgg with 
a ggnet i g Defect at the Duchenne Muscular Dys t rophy Locus 

15 The positive effects of rhGGF2 on myotube survival 

could reflect potential efficacy in degenerative 
disorders. These effects on myotube survival were tested 
on a clonally-derived primary Duchenne myoblast to 
determine if the response observed in normal muscle 

20 culture could also be demonstrated in cultures derived 
from diseased individuals. The data presented in Pig. 5 
was obtained using the same muscle culture conditions 
(Example 4, above) used for normal individual. rhGGF2 
significantly decreased the number of dead myotubes in 

25 the differentiated Duchenne muscle culture, compared to 
controls (p-0.032). Concentrations were as follows: 
GGF2: lOOng/ml; human platelet derived growth factor: 
20ng/ml; human insulin like growth factor I: 30ng/ml. 

This example demonstrates that rhGGF2 can also 

30 promote survival of differentiated Duchenne myotubes and 
provides strong evidence that rhGGF2 may slow or prevent 
the course of muscle degeneration and wasting in mammals. 
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EXAMPLE g 

rhGGF2 Effect on the Differentiation Program; Induction 
of MHC Slow and Dystrophin Proteins 

The effects of purified rhGGF2 on muscle culture 
5 differentiation was also examined by Western analysis of 
culture ly sates. The levels of muscle specific proteins 
were determined in triplicate treated and untreated 
cultures. These cultures were prepared and treated as 
above except that the plate size was increased to 150 mm 

10 and the muscle culture layer was scraped off for Western 
analysis as described in Sklar, R. , and Brown, R. (J . 
N&urol. Sci. 101:73-81, 1991). The results presented in 
Table A indicate that rhGGF2 treatment increases the 
levels of several muscle specific proteins, including 

15 dystrophin, myosin heavy chain (MHC, adult slow and fast 
isof orms) , but does not increase the levels of HSP72 or 
MHC neonate isoform to a similar level per amount of 
protein loaded on the Western. The levels of muscle 
specific proteins induced by rhGGF2 were similar to the 

20 quantitative increases in the number of myonuclei/mm 2 
(Table 3) . 
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TABLE 3 

Control ±SD rhGGF2 Treat- p 

value 

Bent ±SD 



Total Protein (pg) 


554 1 38.4 


798 ± 73.6 


0.007 


Hyonuclei/DB 2 


29.0 ± 12.2 


106 ±24.1 


0.008 


MHC fast/M9 protein 


1.22 ± 0.47 


4.00 ± 0.40 


0.001 


MHC slow/pg protein 


0.17 ± 0.13 


1.66 ± 0.27 


0.001 


MHC neonate//!? protein 


0.30 ± 0.27 


0.55 i 0.04 


0.199 


dystrophin/ jig protein 


6.67 ± 0.37 


25.5 ± 11.0 


0.042 


RSP 12/ nq protein 


3.30 ± 0.42 


4.54 ± 0.08 


0.008 



The rhGGF2 dependent increase in the adult myosin 
heavy chain isoforms (slow is found in type I human 
muscle fibers; fast is found in type 2 A and 2B human 

15 muscle fibers) may represent a maturation of the 

myotubes, as the neonatal isoform was not significantly 
increased by rhGGF2 treatment. During rat muscle 
development MHC isoforms switch from fetal to neonatal 
forms followed by a switch to mature adult slow and fast 

20 MHC isoforms (Periasamy et al. J. Biol. Chom. 259:13573- 
13578, 1984; Periasamy et al. J • Biol. Chmm. 260:15856- 
15862, 1985; Wieczorek et al. J. Cell Biol. 101:618-629, 
1985) . While muscle can autonomously undergo some of 
these isoform transitions in the absence of neural cells 

25 or tissue, mouse muscle explants express the adult fast 
MHC isoform only when cultured in the presence of mouse 
spinal cord (Ecob-Prince et al. J . Cell Biol. 103:995- 
1005, 1986) . Additional evidence that MHC isoform 
transitions are influenced by nerve was established by 

30 Whalen et al. (Deve. Biol. 141:24-40, 1990); after 

regeneration of notexin treated rat soleus muscles only 



WO 94/26298 



PCT7US94/05083 



- 37 - 

th adult fast MHC i so form was produced in th new 
denervated muscle, but innervated regenerated muscle made 
both fast and slow adult MHC isoforms. Thus the 
demonstration in Table 3 that rhGGF2 increases the 
5 synthesis of adult MHC isoforms indicates that rhG6P2 may 
induce a developmental maturation of muscle which may 
mimic neuronal innervation. 

EXAMPLE 7 

Neureoullns. including rhQGF2 . induce the synthesis of 

10 acetylcholine receptors in muscle. 

The expression of acetylcholine receptor (AchR) 
subunit proteins can be induced by exposing muscle cells 
to neuregulins. More specifically, we have shorn that 
contacting muscle cells with rhG6F2 can induce the 

15 synthesis of AchR subunit proteins. This induction 
following rhG6F2 exposure was observed in two ways: 
first, we detected increased expression of human growth 
hormone via the product of a reporter gene construct and 
second we detected increased binding of alpha* 

20 bungarotoxin to cells. 

In the following example a mouse myoblast cell 
line C2 was used. C2 cells were transfected with a 
transgene that contained the 5' regulatory sequences of 
the AChR delta subunit gene of mouse linked to a human 

25 growth hormone full-length cDNA (Baldwin and Burden, 
1988. J. Cell Biol. 107:2271-2279). This reporter 
construct allows the measurement of the induction of AChR 
delta gene expression by assaying the quantity of growth 
hormone secreted into the media. The line can be induced 

30 to form myotubes by lowering serum concentration in the 
media from 20% to 0.5%. 

Specifically, mouse C2 myoblasts transfected with 
an AChR-human growth hormone reporter construct and were 
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assayed for expr ssi n f hGH foil wing treatment with 
rhGGF2. The results of two separate experiments are 
summarized in Table 4 and in Figures 6 (hGH expression) 
and 7 (hGH expression and alpha-bungarotoxin binding) . 
5 Shown are the dose response curves for secreted human 
growth hormone and for bungarotoxin binding from muscle 
cultures treated with rhGGF2 . 



TABLE 4 

Effects of rhGGF2 on the expression of AChR delta 
10 subunit/hGH transgene and the synthesis of AChR 

E202-1 EXEL-2 

GGF hGH hGH AChR 

15 (ul) (ng/ml) (ng/ml) (cpm/mg protein) 



0 9,3 + 2.1 5,7 + 2.1 822 + 170 

0.1 - 6.8 + 1.5 891 + 134 

0.5 - 12.0 + 0.9 993 + 35 

1.0 - 9.7 + 2.3 818 + 67 

20 5.0 17,5 + 2.8 14.7 + 3.5 1300 + 177 

10.0 14.3 + 3.2 14.1 + 3.3 1388 + 137 

15.0 22.0 + 1.4 



25 



C2 myotubes were treated with cold a-BTX (20 nN) 
for 1 hr. at 37 °C, washed with culture medium twice and 
then treated with GGF2. Culture medium was adjusted with 
30 bovine serum albumin at the concentration of 1 mg/ml. 24 
hours later, culture medium was removed and saved for. hGH 
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assay. Muscle cultures wer treated with 125 I-a-BTX (20 
nM) for l hour at 37 «c, washed and scraped in PBS 
containing 1% SDS. Non-specific binding was determined 
in the presence of cold o-BTX (40 nM) . The cell 
5 homogenate was counted for radioactivity and assayed for 
total protein amount. 

The presence of rhG6P2 led to a greater than 2- 
fold increase in hGH gene expression, thereby indicating 
that rhG6P2 induced the synthesis of the delta subunit of 
10 the acetylcholine receptor. Furthermore, increased 

bungarotoxin binding is consistant with assembly of these 
subunit proteins into functional acetylcholine receptors. 
To strenthen the interpretation of these data the 
analysis was repeated on cultures that had the hGH 
15 reporter linked to a metallothiene promoter, which should 
not be responsive to rhGGF2 . The results of that control 
experiment showed that the hGH response was mediated 
through transcriptional activation of the AchR delta 
subunit gene control elements. 
20 These results indicate that rhGGF2 could be useful 

in replenishing AchRs as part of the therapy for the 
autoimmune disease Myasthenia gravis. This activity may 
also be beneficial in treatment of peripheral nerve 
regeneration and neuropathy by stimulating a key step in 
25 re- innervation of muscle. 

EXAMPLE a 

Additional Hitoggnlc Activities of Purifie d ggf-i and 
ggr-Il 

The mitogenic activity of a highly purified sample 
30 containing both GGFs I and II was studied using a 

quantitative method, which allows a single microculture 
to be examined for DMA synthesis, cell morphology, cell 
number and expression of cell antigens. This technique 
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has been modified from a method previously reported by 
Nuir et al., Analytical Biochemistry 185, 377*382, 1990. 
The main modifications are: 1) the use of uncoated 
microtiter plates, 2) the cell number per veil, 3) the 
5 use of 5% Foetal Bovine Plasma (FBP) instead of 10% 

Foetal Calf Serum (FCS) , and 4) the time of incubation in 
presence of mitogens and bromodeoxyuridine (BrdU) , added 
simultaneously to the cultures, in addition the cell 
monolayer was not washed before fixation to avoid loss of 

10 cells, and the incubation time of monoclonal mouse 
anti-BrdU antibody and peroxidase conjugated goat 
ant i -mouse immunoglobulin (XgG) antibody were doubled to 
increase the sensitivity of the assay. The assay, 
optimized for rat sciatic nerve Schwann cells, has also 

15 been used for several cell lines, after appropriate 
modifications to the cell culture conditions. 

I. Methods of Mitoaenesis Testing 

On day 1, purified Schwann cells were plated onto 
uncoated 96 well plates in 5% FBP/Dulbecco's Modified 

20 Eagle Medium (DMEM) (5,000 cells/well). On day 2, GGFs 
or other test factors were added to the cultures, as well 
as BrdU at a final concentration of 10 /im. After 48 
hours (day 4) BrdU incorporation was terminated by 
aspirating the medium and cells were fixed with 200 

25 Ml/vell of 70% ethanol for 20 min at room temperature. 
Next, the cells were washed with water and the DNA 
denatured by incubation with 100 Ml 2N HC1 for 10 min at 
37 °c. Following aspiration, residual acid was 
neutralized by filling the wells with 0.1 M borate 

30 buffer, pH 9.0, and the cells were washed with phosphate 
buffered saline (PBS) . Cells were then treated with 50 
Ml of blocking buffer (PBS containing 0.1% Triton X 100 
and 2% normal goat serum) for 15 min at 37 °c. After 
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aspirati n, in n cl nal m use anti-BrdU antibody (Dako 
Corp., Santa Barbara, CA) (50 >il/well, l.* Mg/»1 diluted 
in blocking buffer) was added and incubated for two hours 
at 37 °C. Unbound antibodies vere removed by three washes 
5 in PBS containing 0.1% Triton X-100 and 

peroxidase-conjugated goat anti-mouse IgG antibody (Dako 
Corp., Santa Barbara, CA) (50 Ml/well, 2 /ig/ml diluted in 
blocking buffer) was added and incubated for one hour at 
37 # C. After three washes in PBS/Triton and a final rinse 

10 in PBS, wells received 100 pl/well of 50 mM 

phosphate/citrate buffer, pH 5.0, containing 0.05% of the 
soluble chromogen o-phenylenediamine (OPD) and 0.02% H 2 0 2 . 
The reaction was terminated after 5-20 min at room 
temperature, by pipetting 80 jil from each well to a clean 

15 plate containing 40 Ml/well of 2N sulfuric acid. The 
absorbance was recorded at 490nm using a plate reader 
(Dynatech Labs) . The assay plates containing the cell 
monolayers were washed twice with PBS and 
immunocytochemically stained for BrdU-DNA by adding 100 

20 Ml/well of the substrate diaminobenzidine (DAB) and 0.02% 
H 2 0 2 to generate an insoluble product. After 10-20 min 
the staining reaction was stopped by washing with water, 
and BrdU-positive nuclei observed and counted using an 
inverted microscope, occasionally, negative nuclei were 

25 counterstained with 0.001% Toluidine blue and counted as 
before. 

IL — Cell lines used for Mitooenesis Assays 

Swiss 3T3 Fibroblasts: Cells, from Flow Labs, 
were maintained in DMEM supplemented with 10% PCS, 
30 penicillin and streptomycin, at 37 # C in a humidified 
atmosphere of 10% C0 2 in air. Cells were fed or 
subcultured every two days. For mitogenic assay, cells 
were plated at a density of 5,000 cells/well in complete 
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medium and incubated for a week until cells were 
confluent and quiescent. The serum containing medium was 
removed and the cell monolayer washed twice with serum 
free-medium. 100 Ml of serum free medium containing 
5 mitogens and 10/iM of BrdU were added to each well and 

incubated for 48 hours. Dose responses to GGFs and serum 
or PDGF (as a positive control) were performed. 

BEX (Baby Hamster Kidney) 21 C13 Fibroblasts: 
Cells from European collection of Animal cell Cultures 

10 (ECACC) , were maintained in Glasgow Modified Eagle Medium 
(GMEM) supplemented with 5% tryptose phosphate broth, 5% 
FCS, penicillin and streptomycin, at 37°c in a humidified 
atmosphere of 5% C0 2 in air. Cells were fed or 
subcultured every two to three days. For mitogenic 

15 assay, cells were plated at a density of 2,000 cell/well 
in complete medium for 24 hours. The serum containing 
medium was then removed and after washing with serum free 
medium, replaced with 100 Ml of 0.1% FCS containing GMEM 
or GMEM alone. GGFs and FCS or bFGF as positive controls 

20 were added, coincident with lOjiM BrdU, and incubated for 
48 hours. Cell cultures were then processed as described 
for Schwann cells. 

C6 Rat Glioma Cell Line: Cells, obtained at 
passage 39, were maintained in DMEM containing 5% FCS, 5% 

25 Horse serum (HS) , penicillin and streptomycin, at 37 °C in 
a humidified atmosphere of 10% C0 2 in air. Cells were 
fed or subcultured every three days. For mitogenic 
assay, cells were plated at a density of 2,000 cells/well 
in complete medium and incubated for 24 hours. Then 

30 medium was replaced with a mixture of 1:1 DMEM and F12 
medium containing 0.1% FCS, after washing in serum free 
medium. Dose responses to GGFs, FCS and aFGF were then 
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performed and cells were proc ssed through the ELISA as 
previously described for the other cell types. 

PC12 (Rat Adrenal Pheochromocytoma Cells) : Cells 
from ECACC, were maintained in RPKI 1640 supplemented 
5 with 10% HS, 5% PCS, penicillin and streptomycin, in 
collagen coated flasks, at 37 *C in a humidified 
atmosphere of 5% C0 2 in air* Cells were fed every three 
days by replacing 80% of the medium. For mitogenic 
assay, cells vere plated at a density of 3,000 cells/well 

10 in complete medium, on collagen coated plates (50 Ml/well 
collagen, Vitrogen Collagen Corp., diluted 1 : 50, 30 min 
at 37*C) and incubated for 24 hours. The medium was then 
placed with fresh RPMI either alone or containing 1 mN 
insulin or 1% PCS. Dose responses to PCS/HS (1:2) as 

15 positive control and to GGPs vere performed as before. 
After 48 hours cells vere fixed and the ELISA performed 
as previously described. 

ttt. Resul t* of Mitoqftnesis Assays: All the experiments 
presented in this Example vere performed using a highly 
20 purified sample from a Sepharose 12 chromatography 
purification step containing a mixture of GGF-I and 

GGP-II (GGPs). 

First, the results obtained vith the BrdU 
incorporation assay vere compared vith the classical 
25 mitogenic assay for Schwann cells based on [125]I-UdR 

incorporation into DKA of dividing cells, described by J. 
P. Brockes (Methods Bnzymol. 147:217, 1987). 

Fig. 12 shows the comparison of data obtained vith 
the tvo assays, performed in the same cell culture 
30 conditions (5,000 cells/veil, in 5% FBP/DMEM, incubated 
in presence of GGFs for 48 hrs) . As clearly shovn, the 
results are comparable, but BrdU incorporation assay 
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appears to be slightly store sensitive , as suggested by 
the shift of the curve to the left of the graph, i.e. to 
lover concentrations of GGFS* 

As described under the section "Methods of 
5 Mitogenesis Testing", after the immunoreactive BrdU-DNA 
has been quantitated by reading the intensity of the 
soluble product of the OPD peroxidase reaction, the 
original assay plates containing cell monolayers can 
undergo the second reaction resulting in the insoluble 

10 DAB product, which stains the BrdU positive nuclei. The 
microcultures can then be examined under an inverted 
microscope, and cell morphology and the numbers of 
BrdU-positive and negative nuclei can be observed. 
In Fig. 13 A and Fig. 13B the BrdU-DNA 

15 immunoreactivity, evaluated by reading absorbance at 490 
nm, is compared to the number of BrdU-positive nuclei and 
to the percentage of BrdU-positive nuclei on the total 
number of cells per veil, counted in the same cultures. 
Standard deviations vere less than 10%. The tvo 

20 evaluation methods shov a very good correlation and the 
discrepancy between the values at the highest dose of 
GGFs can be explained by the different extent of DNA 
synthesis in cells detected as BrdU-positive. 

The BrdU incorporation assay can therefore provide 

25 additional useful information about the biological 

activity of polypeptides on Schwann cells vhen compared 
to the (125) I-UdR incorporation assay. For example, the 
data reported in Fig. 15 show that GGFs can act on 
Schwann cells to induce DNA synthesis, but at lower doses 

30 to increase the number of negative cells present in the 
microculture after 48 hours. 

The assay has then been used on several cell lines 
of different origin. In Fig. 15 the mitogenic responses 
of Schwann cells and Swiss 3T3 fibroblasts to GGFs are 
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compared; despite the weak response btain d in 3T3 
fibroblasts, some clearly BrdU-positive nuclei were 
detected in these cultures. Control cultures were run in 
parallel in presence of several doses of FCS or human 
5 recombinant PDGF, shoving that the cells could respond to 
appropriate stimuli (not shown) • 

The ability of fibroblasts to respond to GGFs vas 
further investigated using the BHK 21 C13 cell line. 
These fibroblasts, derived from kidney, do not exhibit 
10 contact inhibition or reach a quiescent state when 

confluent. Therefore the experimental conditions vere 
designed to have a very low background proliferation 
without compromising the cell viability. GGFs have a 
significant mitogenic activity on BHK21 C13 cells as 
15 shown by Fig. 16 and Fig. 17. Fig. 16 shows the Brdu 

incorporation into DNA by BHK 21 C13 cells stimulated by 
GGFS in the presence of 0.1% FCS. The good mitogenic 
response to FCS indicates that cell culture conditions 
were not limiting. In Fig. 17 the mitogenic effect of 
20 GGFs is expressed as the number of BrdU-positive and 
BrdU~negative cells and as the total number of cells 
counted per well. Data are representative of two 
experiments run in duplicates; at least three fields per 
well were counted. As observed for Schwann cells in 
25 addition to a proliferative effect at low doses, GGFs 
also increase the numbers of nonresponding cells 
surviving. The percentage of BrdU positive cells is 
proportional to the increasing amounts of GGFs added to 
the cultures. The total number of cells after 48 hours 
30 in presence of higher doses of GGFs is at least doubled, 
confirming that GGFs induce DNA synthesis and 
proliferation in BHK21 C13 cells. Under the same 
conditions, cells maintained for 48 hours in the presence 
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of 2% FCS shoved an increase of about six fold (not 
shown) . 

C6 glioma cells have provided a useful model to 
study glial cell properties. The phenotype expressed 
5 seems to be dependent on the cell passage, the cells more 
closely resembling an astrocyte phenotype at an early 
stage, and an oligodendrocyte phenotype at later stages 
(beyond passage 70) . C6 cells used in these experiments 
were from passage 39 to passage 52. C6 cells are a 

10 highly proliferating population, therefore the 

experimental conditions were optimized to have a very lov 
background of BrdU incorporation. The presence of 0.1% 
serum was necessary to maintain cell viability without 
significantly affecting the mitogenic responses, as shown 

15 by the dose response to FCS (Fig. 18) . 

In Fig. 19 the mitogenic responses to aFGF (acidic 
Fibroblast growth factor) and GGFs are expressed as the 
percentages of maximal BrdU incorporation obtained in the 
presence of FCS (8%). Values are averages of two 

20 experiments, run in duplicates. The effect of GGFs was 
comparable to that of a pure preparation of aFGF* aFGF 
has been described as a specific growth factor for C6 
cells (Lim R. et al., Cell Regulation 1:741-746, 1990) 
and for that reason it was used as a positive control. 

25 The direct counting of BrdU positive and negative cells 
was not possible because of the high cell density in the 
microcultures . In contrast to the cell lines so far 
reported, PC12 cells did not show any evident 
responsiveness to GGFS, when treated under culture 

30 conditions in which PC12 could respond to sera (mixture 
of FCS and HS as used routinely for cell maintenance) . 
Nevertheless the number of cells plated per well seems to 
affect the behavior of PC12 cells, and therefore further 
experiments are required. 
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EXAMPLE 9 

Amino acid sequences of purified ggf -i and cgf-tt 
Amino acid sequence analysis studies were 
performed using highly purified bovine pituitary GGF-I 
5 and GGF-II. The conventional single letter code was used 
to describe the sequences. Peptides were obtained by 
lysyl endopeptidase and protease V8 digests, carried out 
on reduced and carboxymethylated samples, with the lysyl 
endopeptidase digest of GGF-II carried out on material 
10 eluted from the 55-65 RD region of a 11% SDS-PAGE (MW 
relative to the above-quoted markers) . 

A total of 21 peptide sequences (see Fig. 8, SEQ 
ID Nos. 1-20, 165) were obtained for GGF-I, of which 12 
peptides (see Fig. 9, SEQ ID Nos. 1, 22-29, 17, 19, and 
15 32) are not present in current protein databases and 
therefore represent unique sequences. A total of 12 
peptide sequences (see Fig. 10, SEQ ID Nos. 42-50 and 
161-163) were obtained for GGF-II, of which 10 peptides 
(see Fig. 11, SEQ ID Nos. 42-50) are not present in 
20 current protein databases and therefore represent unique 
sequences (an exception is peptide GGF-II 06 which shows 
identical sequences in many proteins which are probably 
of no significance given the small number of residues) . 
These novel sequences are extremely likely to correspond 
25 to portions of the true amino acid sequences of GGFs I 
and II. 

Particular attention can be drawn to the sequences 
of GGF-I 07 and GGF-II 12, which are clearly highly 
related. The similarities indicate that the sequences of 
30 these peptides are almost certainly those of the assigned 
GGF species, and are most unlikely to be derived from 
contaminant proteins. 

In addition, in peptide GGF-II 02, the sequence X 
S S is consistent with the presence of an N linked 
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carbohydrate moiety on an asparagine at the position 
denoted by X« 

In general, in Figs* 8 and 10, X represents an 
unknown residue denoting a sequencing cycle where a 
5 single position could not be called with certainty either 
because there was more than one signal of equal size in 
the cycle or because no signal was present. As asterisk 
denotes those peptides where the last amino acid called 
corresponds to the last amino acid present in that 

10 peptide. In the remaining peptides, the signal strength 
after the last amino acid called was insufficient to 
continue sequence calling to the end of that peptide. The 
right hand column indicates the results of a computer 
database search using the GCG package FASTA and TFASTA 

IS programs to analyze the NBRF and EMBL sequence databases. 
The name of a protein in this column denotes identity of 
a portion of its sequence with the peptide amino acid 
sequence called allowing a maximum of two mismatches. A 
question mark denotes three mismatches allowed. The 

20 abbreviations used are as follows: 

HMG-1 High Mobility Group protein-1 

HMG-2 High Mobility Group protein-2 

LH-alpha Luteinizing hormone alpha subunit 
LH-beta Luteinizing hormone beta subunit 

25 EXAMPLE 10 

isolating and cloning of Nucleotide Sequences encoding 
proteins conta ining 6GF-I and GGF-II peptides 

Isolation and cloning of the GGF-II nucleotide 
sequences was performed as outlined herein, using peptide 
30 sequence information and library screening, and was 

performed as set out below. It will be appreciated that 
the peptides of Figs* 10 and 11 can be used as the 
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starting point for is lati n and cloning f GGF-I 
sequences by following the techniques described herein. 
Indeed, Fig. 20, SEQ ID Nos. 51-84) shows possible 
degenerate oligonucleotide probes for this purpose, and 
5 Fig. 22, SEQ ID Nos. 86-115, lists possible PGR primers. 
DNA sequence and polypeptide sequence should be 
obtainable by this means as with GGF-II, and also DNA 
constructs and expression vectors incorporating such DNA 
sequence, host cells genetically altered by incorporating 
10 such constructs/ vectors, and protein obtainable by 
cultivating such host cells. The invention envisages 
such subject matter. 



Iu Design and Synthesis of oligonucleotide Probes and 

Primers 

15 Degenerate DNA oligomer probes were designed by 

backtranslating the amino acid sequences (derived from 
the peptides generated from purified GGF protein) into 
nucleotide sequences. Oligomers represented either the 
coding strand or the non-coding strand of the DNA 

20 sequence. When serine, arginine or leucine were included 
in the oligomer design, then two separate syntheses were 
prepared to avoid ambiguities. For example, serine was 
encoded by either TCN or AGY as in 537 and 538 or 609 and 
610. Similar codon splitting was done for arginine or 

25 leucine (e.g. 544, 545). DNA oligomers were synthesized 
on a Biosearch 8750 4 -column DNA synthesizer using 0- 
cyanoethyl chemistry operated at 0.2 micromole scale 
synthesis. Oligomers were cleaved off the column (500 
angstrom CpG resins) and deprotected in concentrated 

30 ammonium hydroxide for 6-24 hours at 55-60°C. 

Deprotected oligomers were dried tinder vacuum (Speedvac) 
and purified by electrophoresis in gels of 15% acrylamide 
(20 mono : 1 bis) , 50 mM Tris-borate-EDTA buffer 
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c ntaining 7M ur a. Full length oligomers w r d tected 
in the gels by UV shadowing, then the bands were excised 
and DMA oligomers eluted into 1.5 mis H20 for 4-16 hours 
with shaking. The eluate was dried, redissolved in 0.1 
5 ml H 2 0 and absorbance measurements were taken at 260nm. 

Concentrations were determined according to the 
following formula: 

(A 260 x units/ml) (60.6/length -xjiM) 

All oligomers were adjusted to 50 /Of concentration 
10 by addition of H 2 0. 

Degenerate probes designed as above are shorn in 
Fig. 20, SEQ ID MOS. 54-88. 

PGR primers were prepared by essentially the same 
procedures that were used for probes with the following 
15 modifications. Linkers of thirteen nucleotides 

containing restriction sites were included at the 5' ends 
of the degenerate oligomers for use in cloning into 
vectors. DNA synthesis was performed at 1 micromole 
scale using 1,000 angstrom CpG resins and inosine was 
20 used at positions where all four nucleotides were 
incorporated normally into degenerate probes. 
Purifications of PCR primers included an ethanol 
precipitation following the gel electrophoresis 
purification. 

25 tt. Libra ry construction and Screening 

A bovine genomic DMA library was purchased from 
Stratagene (Catalogue Number: 945701). The library 
contained 2 x 10 6 15-20kb Sau3Al partial bovine DNA 
fragments cloned into the vector lambda Dashll. A bovine 

30 total brain cDNA library was purchased from Clonetech 
(Catalogue Number: BL 10139) . Complementary DNA 
libraries were constructed (In Vitrogen; Stratagene) from 
mRNA prepared from bovine total brain, from bovine 



WO 94/26298 FCT/US94/05083 



- 51 - 

pituitary and from bovine posterior pituitary. In 
Vitrogen prepared two cONA libraries: one library was in 
the vector lambda glO, the other in vector pcDNAI (a 
plasmid library) . The Stratagene libraries were prepared 
5 in the vector lambda unizap. Collectively, the cDHA 

libraries contained 14 million primary recombinant phage* 

The bovine genomic library was plated on £• coli 
K12 host strain LB392 on 23 x 23 cm plates (Nunc) at 
150,000 to 200,000 phage plagues per plate. Each plate 

10 represented approximately one bovine genome equivalent. 
Following an overnight incubation at 37 *C, the plates 
were chilled and replicate filters were prepared 
according to procedures of Maniatis et al. (2:60-81). 
Four plague lifts were prepared from each plate onto 

15 uncharged nylon membranes (Pall Biodyne A or MSI 

Nitropure) • The DNA was immobilized onto the membranes 
by cross-linking under UV light for 5 minutes or, by 
baking at 80 *C under vacuum for two hours. DNA probes 
were labelled using T4 polynucleotide kinase (New England 

20 Biolabs) with gamma 32P ATP (New England Nuclear; 6500 
Ci/mmol) according to the specifications of the 
suppliers. Briefly, 50 pmols of degenerate DNA oligomer 
were incubated in the presence of 600 /iCi gamma 32 P-ATP 
and 5 units T4 polynucleotide kinase for 30 minutes at 

25 37 # C. Reactions were terminated, gel electrophoresis 
loading buffer was added and then radiolabeled probes 
were purified by electrophoresis. 32P labelled probes 
were excised from gel slices and eluted into water. 
Alternatively, DNA probes were labelled via PCR 

30 amplification by incorporation of a-32P-dATP or o-32P 

dCTP according to the protocol of Schowalter and Sommer, 
Anal. Biochem 177:90-94 (1989). Probes labelled in PCR 
reactions were purified by desalting on Sephadex G-150 
columns. 
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Prehybridization and hybridization were performed 
in GHC buffer (0.52 M NaPi, 7% SDS, 1% BSA, 1.5 mM EDTA, 
0.1 M NaCl 10 mg/ml tRNA) . Washing was performed in 
oligovash (160 ml 1 M Na 2 HP0 4 , 200 ml 20% SDS, 8.0 ml 0.5 
5 M EDTA, 100 ml 5M NaCl, 3632 ml H20) . Typically, 20 

filters (400 sq. centimeters each) representing replicate 
copies of ten bovine genome equivalents were incubated in 
200 al hybridization solution with 100 pmols of 
degenerate oligonucleotide probe (128-512 fold 

10 degenerate) . Hybridization was allowed to occur 

overnight at 5*C below the minimum melting temperature 
calculated for the degenerate probe. The calculation of 
minimum melting temperature assumes 2°C for an AT pair 
and 4°C for a GC pair. 

15 Filters were washed in repeated changes of 

oligowash at the hybridization temperatures four to five 
hours and finally, in 3.2M tetramethylammonium chloride, 
1% SDS twice for 30 min at a temperature dependent on the 
DMA probe length. For 20mers, the final wash temperature 

20 was 60 °C. Filters were mounted, then exposed to X-ray 
film (Kodak XAR5) using intensifying screens (Dupont 
Cronex Lightening Plus) . Usually, a three to five day 
film exposure at minus 80 *C was sufficient to detect 
duplicate signals in these library screens. Following 

25 analysis of the results, filters could be stripped and 
reprobed. Filters were stripped by incubating through 
two successive cycles of fifteen minutes in a microwave 
oven at full power in a solution of 1% SDS containing 
lOmM EDTA pH8. Filters were taken through at least three 

30 to four cycles of stripping and reprobing with various 
probes. 
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tti. Recombinant Phage Isolation, Growth and PNA 
Preparation 

These procedures followed standard protocol as 
described in Recombinant DNA (Maniatis et al 2:60-2:81)* 

5 TV. Analys is of Isolated Clones Using DNA Digestion and 
Southern Blots 

Recombinant Phage DMA samples (2 micrograms) were 
digested according to conditions recommended by the 
restriction endonuclease supplier (New England Biolabs) . 

10 Following a four hour incubation at 37 # C, the reactions 
products were precipitated in the presence of 0.1M sodium 
acetate and three volumes of ethanol. Precipitated DNA 
was collected by centrifugation, rinsed in 75% ethanol 
and dried. All resuspended samples were loaded onto 

15 agarose gels (typically 1% in TAE buffer; 0.04M Tris 
acetate, 0.002M EDTA) . Gel runs were at 1 volt per 
centimeter from 4 to 20 hours. Markers included lambda 
Hind III DNA fragments and/or 0X174HaeIII DNA fragments 
(New England Biolabs). The gels were stained with 0.5 

20 micrograms /ml of ethidium bromide and photographed. For 
southern blotting, DNA was first depurinated in the gel 
by treatment with 0.125 N HC1, denatured in 0.5 N NaOH 
and transferred in 2 Ox SSC (3M sodium chloride, 0.03 M 
sodium citrate) to uncharged nylon membranes. Blotting 

25 was done for € hours up to 24 hours, then the filters 
were neutralized in 0.5 Tris HC1 pH 7.5, 0.15 M sodium 
chloride, then rinsed briefly in 50 mM Tris-borate EDTA. 

For cross-linking, the filters were wrapped first 
in transparent plastic wrap, then the DNA side exposed 

30 for five minutes to an ultraviolet light. Hybridization 
and washing was performed as described for library 
screening (see section 2 of this Example) • For 
hybridization analysis to determine whether similar genes 
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exist in other species slight modifications were made. 
The DNA filter was purchased from Clonetech (Catalogue 
Number 7753*1) and contains 5 micrograms of EcoRI 
digested DNA from various species per lane. The probe 
5 was labelled by PGR amplification reactions as described 
in section 2 above, and hybridizations were done in 80% 
buffer B(2 g polyvinylpyrrolidine, 2 g Ficoll-400, 2 g 
bovine serum albumin, 50 ml 1M Tris-HCl (pH 7.5) 58 g 
NaCl, l g sodium pyrophosphate, 10 g sodium dodecyl 

10 sulfate, 950ml H 2 0) containing 10% dextran sulfate. The 
probes were denatured by boiling for ten minutes then 
rapidly cooling in ice water. The probe was added to the 
hybridization buffer at 10 6 dpm 32 P per ml and incubated 
overnight at 60°C. The filters were washed at 60°C first 

15 in buffer B followed by 2X SSC, 0.1% SOS then in lx SSC, 
0.1% SDS. For high stringency, experiments, final washes 
were done in 0.1 x SSC, 1% SOS and the temperature raised 
to 65 # C. 

Southern blot data were used to prepare a 
20 restriction map of the genomic clone and to indicate 
which subfragments hybridized to the GGF probes 
(candidates for subcloning) . 

V. subclonina of Segments of DNA Homologous to 

Hybridization Probes 

25 DNA digests (e.g. 5 micrograms) were loaded onto 

1% agarose gels then appropriate fragments excised from 
the gels following staining. The DNA was purified by 
adsorption onto glass beads followed by elution using the 
protocol described by the supplier (Bio 101) • Recovered 

30 DNA fragments (100-200 ng) were ligated into linearized 
dephosphorylated vectors, e.g. pT3T7 (Ambion) , which is a 
derivative of pUC18, using T4 ligase (New England 
Biolabs) . This vector carries the E. soil p lactamase 
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g ne, henc , transf rmants can be selected on plates 
containing ampicillin. The vector also supplies fi- 
galactosidase complementation to the host cell, therefore 
non-recombinants (blue) can be detected using 
5 isopropylthiogalactoside and Bluogal (Bethesda Research 
Labs) . A portion of the ligation reactions was used to 
transform £. coli K12 XL1 blue competent cells 
(Stratagene Catalogue Number: 200236) and then the 
transformants were selected on LB plates containing 50 
10 micrograms per ml ampicillin. White colonies were 

selected and plasmid mini preps were prepared for DNA 
digestion and for DNA sequence analysis. Selected clones 
were retested to determine if their insert DNA hybridized 
with the G6F probes. 

15 vi. DNA Sequencing 

Double stranded plasmid DNA templates were 
prepared from 5 ml cultures according to standard 
protocols. Sequencing was by the dideoxy chain 
termination method using Sequenase 2.0 and a 

20 dideoxynucleotide sequencing kit (US Biochemical) 

according to the manufacturers protocol (a modification 
of Sanger et al. PNAS; USA 21:5463 (1977)]. 
Alternatively, sequencing vas done in a DNA thermal 
cycler (Perkin Elmer, model 4800) using a cycle 

25 sequencing kit (New England Biolabs; Bethesda Research 
Laboratories) and vas performed according to 
manufacturers instructions using a 5 '-end labelled 
primer. Sequence primers were either those supplied 
with the sequencing kits or were synthesized according to 

30 sequence determined from the clones* Sequencing 

reactions were loaded on and resolved on 0.4mm thick 
sequencing gels of 6% polyacrylamide. Gels were dried 
and exposed to X-Ray film. Typically, 35S was 
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incorporated when standard sequencing kits were used and 
a 32P end labelled primer was used for cycle sequencing 
reactions. Sequences were read into a DNA sequence 
editor from the bottom of the gel to the top (5' 
5 direction to 3') and data were analyzed using programs 
supplied by Genetics Computer Group (GCG, University of 
Wisconsin) . 

XII* RHA Preparation and ?QR Anplifigatign 

Open reading frames detected in the genomic DNA 

10 and which contained sequence encoding GGF peptides were 
extended via PGR amplification of pituitary RNA« RNA was 
prepared from frozen bovine tissue (Pelfreeze) according 
to the guanidine neutral-CsCl procedure (Chirgvin at. al. 
Biochemistry 18:5294(1979).) Polyadenylated RNA was 

15 selected by oligo-dT cellulose column chromatography 
(Aviv and Leder FNAS (USA) 69:1408 (1972)). 

Specific DNA target sequences were amplified 
beginning with either total RNA or polyadenylated RNA 
samples that had been converted to cDNA using the Perkin 

20 Elmer PCR/RNA Kit Number: N808-0017. First strand 

reverse transcription reactions used 1 fig template RNA 
and either primers of oligo dT with restriction enzyme 
recognition site linkers attached or specific antisense 
primers determined from cloned sequences with restriction 

25 sites attached. To produce the second strand, the 

primers either were plus strand unique sequences as used 
in 3' RACE reactions (Frohman et. al., PNAS (USA) ££:8998 
(1988)) or were oligo dT primers with restriction sites 
attached if the second target site had been added by 

30 terminal transferase tailing first strand reaction 

products with dATP (e.g. 5' race reactions, Frohman et. 
al., ibid). Alternatively, as in anchored PGR reactions 
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the sec nd strand prim rs were deg nerat , hence, 
representing particular peptide sequences. 

The amplification profiles followed the following 
general scheme: 1) five minutes soak file at 95 °C; 2) 
5 thermal cycle file of 1 minute, 95°C; 1 minute ramped 
down to an annealing temperature of 45°C, 50 # C or 55°C; 
maintain the annealing temperature for one minute; ramp 
up to 72 # C over one minute; extend at 72 °C for one minute 
or for one minute plus a 10 second auto extension; 3) 

10 extension cycle at 72°C, five minutes, and; 4) soak file 
4°C for infinite time. Thermal cycle files (#2) usually 
were run for 30 cycles. A sixteen Ml sample of each 100 
til amplification reaction was analyzed by electrophoresis 
in 2% Nusieve 1% agarose gels run in TAB buffer at 4 

15 volts per centimeter for three hours. The gels were 

stained, then blotted to uncharged nylon membranes which 
were probed with labelled DNA probes that were internal 
to the primers. 

Specific sets of DNA amplification products could 

20 be identified in the blotting experiments and their 
positions used as a guide to purification and 
reamplification. When appropriate, the remaining 
portions of selected samples were loaded onto preparative 
gels, then following electrophoresis four to five slices 

25 of 0.5 mm thickness (bracketing the expected position of 
the specific product) were taken from the gel. The 
agarose was crushed, then soaked in 0.5 ml of 
electrophoresis buffer from 2-16 hours at 40 # C. The 
crushed agarose was centrifuged for two minutes and the 

30 aqueous phase was transferred to fresh tubes. 

Reamplification was done on five microliters 
(roughly 1% of the product) of the eluted material using 
the same sets of primers and the reaction profiles as in 
the original reactions. When the reamplification 
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reactions were completed, samples were extracted with 
chloroform and transferred to fresh tubes. Concentrated 
restriction enzyme buffers and enzymes were added to the 
reactions in order to cleave at the restriction sites 
5 present in the linkers. The digested PGR products were 
purified by gel electrophoresis, then subcloned into 
vectors as described in the subcloning section above. 
DMA sequencing was done described as above. 

VTTI, DNA Sequence Analysis 

10 DNA sequences were assembled using a fragment 

assembly program and the amino acid sequences deduced by 
the GCG programs GelAssemble, Map and Translate. The 
deduced protein sequences were used as a query sequence 
to search protein sequence databases using WordSearch. 

15 Analysis was done on a VAX Station 3100 workstation 

operating under VMS 5.1. The database search was done on 
SwissProt release number 21 using GCG Version 7.0. 

TX- Results of Clonin g and Sequencing of genes encoding 
GGF-T and GGF-II 

20 As indicated above, to identify the DNA sequence 

encoding bovine GGF-II degenerate oligonucleotide probes 
were designed from GGF-II peptide sequences. GGF-II 12 
(SEQ ID No. 44), a peptide generated via lysyl 
endopeptidase digestion of a purified GGF-II preparation 

25 (see Figs. 16 and 12) showed strong amino acid sequence 
homology with GGF-I 07 (SEQ ID No. 39) , a tryptic peptide 
generated from a purified GGF-I preparation. GGF-II 12 
was thus used to create ten degenerate oligonucleotide 
probes (see oligos 609, 610 and 649 to 656 in Fig. 20, 

30 SEQ ID Nos. 66, 67, 68 and 75, respectively). A 

duplicate set of filters were probed with two sets (set 
1-609, 610; set 2-649-5656) of probes encoding two 
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overlapping portions of GGF-II 12. Hybridization signals 
were observed, but, only one clone hybridized to both 
probe sets. The clone (designated 6GF2B61) was purified. 

Southern blot analysis of DNA from the phage clone 
5 GGF2BG1 confirmed that both sets of probes hybridized 
with that bovine DNA sequence, and shoved further that 
both probes reacted with the same set of DNA fragments 
within the clone. Based on those experiments a 4 Jcb Eco 
RI sub-fragment of the original clone was identified, 

10 subcloned and partially sequenced. Fig. 21 shows the 

nucleotide sequence, SEQ ID No. 89) and the deduced amino 
acid sequence of the initial DNA sequence readings that 
included the hybridization sites of probes 609 and 650, 
and confirmed that a portion of this bovine genomic DNA 

15 encoded peptide 12 (KASLADSGEYM) . 

Further sequence analysis demonstrated that GGF-II 
12 resided on a 66 amino acid open reading frame (see 
below) which has become the starting point for the 
isolation of overlapping sequences representing a 

20 putative bovine GGF-II gene and a cDNA. 

Several PGR procedures were used to obtain 
additional coding sequences for the putative bovine 
GGF-II gene. Total RNA and oligo dT-selected (poly A 
containing) RNA samples were prepared from bovine total 

25 pituitary, anterior pituitary, posterior pituitary, and 
hypothalamus. Using primers from the list shown in Fig. 
22, SEQ ID Nos. 109-119, one-sided PCR reactions (RACE) 
were used to amplify cDNA ends in both the 3' and 5' 
directions, and anchored PGR reactions were performed 

30 with degenerate oligonucleotide primers representing 
additional GGF-II peptides. Fig. 29 summarizes the 
contiguous DNA structures and sequences obtained in those 
experiments. From the 3 ' RACE reactions, three 
alternatively spliced cDNA sequences were produced, 
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which have been cloned and s quenc d. A 5' RACE 
reaction led to the discovery of an additional exon 
containing coding sequence for at least 52 amino acids. 
Analysis of that deduced amino acid sequence revealed 
5 peptides GGF-II-6 and a sequence similar to GGF-I-18 (see 
below) • The anchored PGR reactions led to the 
identification of (cDNA) coding sequences of peptides 
GGF-II-1, 2, 3 and 10 contained within an additional cDNA 
segment of 300 bp- The 5' limit of this segment (i.e., 

10 segment E, see Fig. 30) is defined by the oligonucleotide 
which encodes peptide GGF-II-1 and which was used in the 
PGR reaction (additional 5' sequence data exists as 
described for the human clone in Example 11) . Thus this 
clone contains nucleotide sequences encoding six out of 

15 the existing total of nine novel GGF-II peptide 
sequences . 

The cloned gene was characterized first by 
constructing a physical map of GGF2BG1 that allowed us to 
position the coding sequences as they were found (see 

20 below, Fig. 30) . DNA probes from the coding sequences 
described above have been used to identify further DNA 
fragments containing the exons on this phage clone and to 
identify clones that overlap in both directions • The 
putative bovine GGF-II gene is divided into at least 5 

25 coding segments. Coding segments are defined as discrete 
lengths of DNA sequence which can be translated into 
polypeptide sequences using the universal genetic code. 
The coding segments described in Fig. 36 and referred to 
in the present application are: 1) particular exons 

30 present within the GGF gene (e.g. coding segment a), or 
2) derived from sets of two or more exons that appear in 
specific sub-groups of mRNAs, where each set can be 
translated into the specific polypeptide segments as in 
the gene products shown. The polypeptide segments 
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r f rred to in th claims are the translation products of 
the analogous DNA coding segments • Only coding segments 
A and B have been defined as exons and sequenced and 
mapped thus far* The summary of the contiguous coding 
5 sequences identified is shown in Fig* 31* The exons are 
listed (alphabetically) in the order of their discovery. 
It is apparent from the intron/exon boundaries that exon 
B may be included in cDNAs that connect coding segment E 
and coding segment A. That is, exon B cannot be spliced 

10 out without compromising the reading frame* Therefore, 
we suggest that three alternative splicing patterns can 
produce putative bovine GGF-II cDNA sequences 1, 2 and 
3* The coding sequences of these, designated 
G6F2BPP1 • CDS , G6F2BPP2 . CDS and GGF2BPP3 • CDS , 

15 respectively, are given in Figs. 27A (SEQ ID No. 129) , 
27B (SEQ ID No. 130) , and 27C (SEQ ID No. 131) , 
respectively. The deduced amino acid sequence of the 
three cDNAs is also given in Figs. 27A, (SEQ ID No. 129) , 
27B (SEQ ID No. 130) , and 27C (SEQ ID No. 131) . 

20 The three deduced structures encode proteins of 

lengths 206, 281 and 257 amino acids. The first 183 
residues of the deduced protein sequence are identical in 
all three gene products. At position 184 the clones 
differ significantly. A codon for glycine GGT in 

25 66F2BPP1 also serves as a splice donor for G6F2BPP2 and 
G6F2BPP3, which alternatively add on exons C, C/D, C/D' 
and D or C, C/D and D, respectively, and shown in Fig. 
32, SEQ ID No. 145)* GGFIIBPP1 is a truncated gene 
product which is generated by reading past the coding 

30 segment A splice junction into the following intervening 
sequence (intron) • This represents coding segment A' in 
Fig. 30 (SEQ ID No. 136). The transcript ends adjacent 
to a canonical AATAAA polyadenylation sequence, and we 
suggest that this truncated gene product represents a 
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bona fide mature transcript. The other two longer gene 
products share the same 3' untranslated sequence and 
polyadenylation site. 

All three of these molecules contain six of the 
5 nine novel GGF-II peptide sequences (see Fig. 11) and 
another peptide is highly homologous to GGF-I-18 (see 
Fig. 26) . This finding gives a high probability that 
this recombinant molecule encodes at least a portion of 
bovine GGF-II. Furthermore , the calculated isoelectric 

10 points for the three peptides are consistent with the 

physical properties of GGF-I and II. Since the molecular 
size of GGF-II is roughly 60 kD, the longest of the three 
cDKAs should encode a protein with nearly one-half of the 
predicted number of amino acids. 

15 A probe encompassing the B and A exons was 

labelled via PGR amplification and used to screen a cDNA 
library made from RNA isolated from bovine posterior 
pituitary. One clone (GGF2BPP5) shoved the pattern 
indicated in Fig. 29 and contained an additional DNA 

20 coding segment (G) between coding segments A and c. The 
entire nucleic acid sequence is shown in Fig. 31 (SEQ ID 
No. 144) . The predicted translation product from the 
longest open reading frame is 241 amino acids. A portion 
of a second cDNA (GGF2BPP4) was also isolated from the 

25 bovine posterior pituitary library using the probe 

described above. This clone showed the pattern indicated 
in Fig. 29. This clone is incomplete at the 5' end, but 
is a splicing variant in the sense that it lacks coding 
segments G and D. BPP4 also displays a novel 3' end with 

30 regions H f K and L beyond region C/D. The sequence of 
BPP4 is shown in Fig. 33 (SEQ ID No. 146) . 

EXAMPLE 11 

GGF Sequences in V arious Species 
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The GGF proteins are th members of a new 
super family of proteins. In high stringency cross 
hybridization studies (DNA blotting experiments) with 
other mammalian DNAs ve have shown, clearly, that DNA 
5 probes from this bovine recombinant molecule can readily 
detect specific sequences in a variety of samples tested. 
A highly homologous sequence is also detected in human 
genomic DNA. The autoradiogram is shown in Fig. 28. The 
signals in the lanes containing rat and human DNA 
10 represent the rat and human equivalents of the GGF gene, 
the sequences of several cDNA's encoded by this gene have 
been recently reported by Holmes et al. (Science 
1205 (1992)) and Wen et al. (Cell £&: 559 (1992)). 



EXAMPLE 12 

15 Isolation of a Human Sequence Encoding Human GGF2 

Several human clones containing sequences from the 
bovine GGFII coding segment E were isolated by screening 
a human cDNA library prepared from brain stem (Stratagene 
catalog #935206). This strategy was pursued based on the 

20 strong link between most of the GGF2 peptides (unique to 
GGF2) and the predicted peptide sequence from clones 
containing the bovine E segment. This library was 
screened as described in Example 8, Section II using the 
oligonucleotide probes 914*919 listed below. 

25 914TCGGGCTCCATGAAGAAGATGTA (SEQ ID NO: 179) 

9 15TCCATGAAGAAGATGTACCTGCT (SEQ ID NO: 180) 

9 1 6ATGTACCTGCTGTCCTCCTTGA (SEQ ID NO: 181) 

9 17TTGAAGAAGGACTCGCTGCTCA (SEQ ID NO: 182) 

9 18 AAAGCCGGGGGCTTGAAGAA (SEQ ID NO: 183) 

30 9 19ATGARGTGTGGGCGGCGAAA (SEQ ID NO: 184) 

Clones detected with these probes were further 
analyzed by hybridization. A probe derived from coding 
segment A (see Fig. 30) , which was produced by labeling a 
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polymerase chain reaction (PCR) product from s gment A, 
was also used to screen the primary library. Several 
clones that hybridized with both A and E derived probes 
were selected and one particular clone, GGF2HBS5, was 
5 selected for further analysis. This clone is represented 
by the pattern of coding segments (EBACC/D'D as shown in 
Fig. 30). The B segment in this clone is the human 
equivalent of the truncated bovine version of £ shown in 
Fig. 30. G6F2HBS5 is the most likely candidate to encode 

10 GGF-II of all the "putative 1 * GGF-II candidates described. 
The length of coding sequence segment E is 786 
nucleotides plus 264 bases of untranslated sequence. The 
predicted size of the protein encoded by 66F2HBS5 is 
approximately 423 amino acids (approximately 45 

15 kilodaltons, see Fig. 44, SEQ ID NO: 21), which is 

similar to the size of the deglycosylated form of GGF-II 
(see Example 20) . Additionally, seven of the GGF-II 
peptides listed in Fig. 26 have equivalent sequences 
which fall within the protein sequence predicted from 

20 region E. Peptides II-6 and 11-12 are exceptions, which 
fall in coding segment B and coding segment A, 
respectively. RKA encoding the GGF2HBS5 protein was 
produced in an in vitro transcription system driven by 
the bacteriophage T7 promoter resident in the vector 

25 (Bluescript SK [Stratagene Inc.] see Fig. 47) containing 
the GGF2HBS5 insert. This RKA was translated in a cell 
free (rabbit reticulocyte) translation system and the 
size of the protein product was 45 Kd. Additionally, the 
cell-free product has been assayed in a Schwann cell 

30 mitogenic assay to confirm biological activity. Schwann 
cells treated with conditioned medium show both increased 
proliferation as measured by incorporation of 125 I-Uridine 
and phosphorylation on tyrosine of a protein in the 185 
fcilodalton range. 
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Thus the size f the product encod d by GGF2HBS5 
and the presence of DNA sequences which encode human 
peptides highly homologous to the bovine peptides shown 
in Fig. 11 confirm that GGF2HBS5 encodes the human 
5 equivalent of bovine GGF2. The fact that conditioned 
media prepared from cells transformed with this clone 
elicits Schwann cell mitogenic activity confirms that the 
GGFIIHBS5 gene produce (unlike the BPP5 gene product) is 
secreted. Additionally the GGFIIBPP5 gene product seems 
10 to mediate the Schwann cell proliferation response via a 
receptor tyrosine kinase such as pl85* rbB2 or a closely 
related receptor (see Example 19) . 

EXAMPLE 13 

Expression of Human Recombinant GGF2 in Mammalian and 

is inMct Cells 

The GGF2HBS5 cDNA clone encoding human GGF2 (as 
described in Example 12 and also referred to herein as 
HBS5) was cloned into vector pcDL-SRa296 and COS-7 cells 
were trans f ected in 100 mm dishes by the DEAE-dextran 

20 method. Cell lysates or conditioned media from 

transiently expressing COS cells were harvested at 3 or 4 
days post-transfection. To prepare lysates, cell 
monolayers were washed with PBS, scraped from the dishes 
lysed by three freeze/thaw cycles in 150 of 0.25 M 

25 Tris-HCl, pH8. Cell debris was pelleted and the 

supernatant recovered. Conditioned media samples (7 
mis.) were collected, then concentrated and buffer 
exchanged with 10 mm Tris, pH 7.4 using Centiprep-10 and 
Centricon-10 units as described by the manufactures 

30 (Amicon, Beverly, MA) . Rat nerve Schwann cells were 

assayed for incorporation of DNA synthesis precursors, as 
described. Conditioned media or cell lysate samples were 
tested in the Schwann cell proliferation assay as 
described in Marchionni et al., Nature 362:313 (1993). 
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The cDNA, GGF2HBS5, encoding GGF2 directed the secretion 
of the protein product to the medium. Minimal activity 
was detectable inside the cells as determined by assays 
using cell ly sates. GGF2HFB1 and GGFBPP5 cDNA's failed 
5 to direct the secretion of the product to the 

extracellular medium. GGF activity from these clones was 
detectable only in cell ly sates. 

Recombinant GGF2 was also expressed in GHO cells. 
The GGF2HBS5 cDNA encoding 6GF2 was cloned into the EcoRI 

10 site of vector pcdhfrpolyA and transfected into the DHFR 
negative GHO cell line (GG44) by the calcium phosphate 
coprecipitation method. Clones were selected in 
nucleotide and nucleoside free a medium (Gibco) in 96* 
veil plates. After 3 weeks, conditioned media samples 

15 from individual clones were screened for expression of 
GGF by the Schwann cell proliferation assay as described 
in Marchionni et al., Nature 362:313 (1993). Stable 
clones which secreted significant levels of GGF activity 
into the aedium were identified. Schwann cell 

20 proliferation activity data from different volume 

aliquots of GHO cell conditioned medium were used to 
produce the dose response curve shown in Fig. 46 (Graham 
and Van Der Eb, Virology 52:456, 1973). This material 
was analyzed on a Western blot probed with polyclonal 

25 antisera raised against a GGF2 specific peptide. A band 
of approximately 65 Kd (the expected size of GGF2 
extracted from pituitary) is specifically labeled (Fig. 
48, lane 12). 

Recombinant GGF2 was also expressed in insect 

30 cells using the Baculovirus expression. Sf 9 insect cells 
were infected with baculovirus containing the GGF2HBS5 
cDNA clone at a multiplicity of 3-5 (10 6 cells/ml) and 
cultured in Sf900-II medium. Schwann cell mitogenic 
activity was secreted into the extracellular medium. 
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Different volumes of insect cell conditioned medium were 
tested in the Schwann cell proliferation assay in the 
absence of forskolin and the data used to produce a dose 
response curve. 

5 This material was also analyzed on a Western blot 

(Fig. 45B) probed with the 6GF II specific antibody 
described above* 

The methods used in this example were as follows: 
Schwann cell mitogenic activity of recombinant 
10 human and bovine glial growth factors was determined as 
follows: Mitogenic responses of cultured Schwann cells 
were measured in the presence of 5 mM forskolin using 
crude recombinant GGF preparations obtained from 
transient mammalian expression experiments. 
15 Incorporation of [ 125 I]-Urd was determined following an 
18-24 hour exposure to materials obtained from 
transfected or mock transfected cos cells as described in 
the Methods. The mean and standard deviation of four 
sets of data are shown. The mitogenic response to 
20 partially purified native bovine pituitary GGF 

(carboxymethyl cellulose fraction; Goodearl et al., 
submitted) is shown (GGF) as a standard of one hundred 
percent activity. 

cONAs (Fig. 46, SEQ ID NOs. 166-168) were cloned 
25 into pcDL-SRcr296 (Takebe et al., Mol. Cell Biol. 8:466- 
472 (1988)), and COS-7 cells were transfected in 100 mm 
dishes by the DEAE-dextran method (Sambrook et al., In 
Molecular Cloning. A Laboratory Manual, 2nd. ad. (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
30 1989)). Cell lysates or conditioned media were harvested 
at 3 or 4 days post -trans feet ion. To prepare lysates, 
cell monolayers were washed with PBS, scraped from the 
dishes, and lysed by three freeze/ than cycles in 150 Ml 
of 0.25 M Tris-HCl, pH 8. Cell debris was pelleted and 
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the supemate recovered. Conditioned media samples (7 
mis) were collected, then concentrated and buffer 
exchanged with 10 mM Tris, pH 7.4 using Centriprep-10 and 
Centricon-10 units are described by the manufacturers 
5 (Amicon, Beverly, MA) • Rat sciatic nerve Schwann cells 
were assayed for incorporation of DNA synthesis 
precursors, as described (Davis and Stroobant, J, Cell 
Biol. 110:1353-1360 (1990); Brockes et al., Brain Res. 
165:105-118 (1979)). 

10 Western blot of recombinant GHO cell conditioned 

medium were performed as follows: A recombinant GHO 
clone was cultured in MCDB302 protein-free for 3 days. 2 
ml of conditioned medium was harvested, concentrated, 
buffered exchanged against 10 mM Tris-BCl, pH 7.4 and 

15 lyophilized to dryness. The pellet was resuspended in 
SDS-PA6E sample buffer, subjected to reducing SDS gel 
electrophoresis and analyzed by Western blotting with a 
GGF peptide antibody. A CHO control was done by using 
conditioned medium from untransfected CHO-DG44 host and 

20 the CHO HBS5 levels were assayed using conditioned medium 
from a recombinant clone. 

EXAMPLE 14 

Identification of Functional Elements of GGF 

The deduced structures of the family of GGF 

25 sequences indicate that the longest forms (as represented 
by GGF2BPP4) encode transmembrane proteins where the 
extracellular part contains a domain which resembles 
epidermal growth factor (see Carpenter and Wahl in 
Peptide Growth Factors and Their Receptors I pp. 69-133, 

30 Springer-Verlag, NY 1991) . The positions of the cysteine 
residues in coding segments C and C/D or C/D' peptide 
sequence are conserved with respect to the analogous 
residues in the epidermal growth factor (EGF) peptide 
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sequenc (s e Fig, 32, SEQ ID Nos. 147-149). This 
suggests that the extracellular domain functions as 
receptor recognition and biological activation sites. 
Several of the variant forms lack the H, K, and L coding 
5 segments and thus may be expressed as secreted, 
diffusible biologically active proteins. GGF DNA 
sequences encoding polypeptides which encompass the EGF- 
like domain (EGFL) can have full biological activity for 
stimulating glial cell mitogenic activity. 

10 Membrane bound versions of this protein may induce 

Schwann cell proliferation if expressed on the surface of 
neurons during embryogenesis or during nerve regeneration 
(where the surfaces of neurons are intimately associated 
with the surfaces of proliferating Schwann cells) . 

15 Secreted (non membrane bound) GGFs may act as 

classically diffusible factors which can interact with 
Schwann cells at some distance from their point of 
secretion. Other forms may be released from intracells 
by sources via tissue injury and cell disruption. An 

20 example of a secreted GGF is the protein encoded by 

GGF2HBS5; this is the only GGF known which has been found 
to be directed to the exterior of the cell. Secretion is 
probably mediated via an N- terminal hydrophobic sequence 
found only in region E, which is the N-terminal domain 

25 contained within recombinant GGF2 encoded by GGF2HBS5. 

Other GGF's appear to be non-secreted. These GGFs 
may be injury response forms which are released as a 
consequence of tissue damage. 

Other regions of the predicted protein structure 

30 of GGF2 (encoded by GGF2HBS5) and other proteins 

containing regions B and A exhibit similarities to the 
human basement membrane heparan sulfate proteoglycan core 
protein. The peptide ADSGEY, which is located next to 
the second cysteine of the C2 immunoglobulin fold in 
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these GGF's, occurs in nine of twenty-two C-2 repeats 
found in that basal lamina protein. This evidence 
strongly suggests that these proteins may associate with 
matrix proteins such as those associated with neurons and 
5 glia, and may suggest a method for sequestration of glial 
growth factors at target sites. 

EXAMPLE IS 

Purificat ion of GGPa from Recombinant Cells 

In order to obtain full length or portions of GGFs 

10 to assay for biological activity , the proteins can be 

overproduced using cloned DNA. Several approaches can be 
used. A recombinant £. coli cell containing the 
sequences described above can be constructed. Expression 
systems such as pNH8a or pHH16a (Stratagene, Inc.) can be 

15 used for this purpose by following manufacturers 
procedures. Alternatively, these sequences can be 
inserted in a mammalian expression vector and an 
overproducing cell line can be constructed. As an 
example, for this purpose DNA encoding a GGF, clone 

20 6GF2BPP5 has been expressed in COS cells and can be 

expressed in Chinese hamster ovary cells using the pMSXND 
expression vector (Lee and Nathans, J. Biol. Chem. 
3521-3527, (1981)). This vector containing GGF DNA 
sequences can be transfected into host cells using 

25 established procedures. 

Transient expression can be examined or 
G4 18 -resistant clones can be grown in the presence of 
methotrexate to select for cells that amplify the dhfr 
gene (contained on the pMSXND vector) and, in the 

30 process, co-amplify the adjacent GGF protein encoding 
sequence. Because CHO cells can be maintained in a 
totally protein-free medium (Hamilton and Ham, In Vitro 
12, 537-547 (1977)), the desired protein can be purified 
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from the medium* Western analysis using the antisera 
produced in Example 17 can be used to detect the presence 
of the desired protein in the conditioned medium of the 
overproducing cells. 
5 The desired protein (rGGF2) was purified from the 

medium conditioned by transiently expressing cos cells as 
follows* rGGF II was harvested from the conditioned 
medium and partially purified using Cation Exchange 
Chromatography (POROS-HS) . The column was equilibrated 

10 with 33.3 mM MES pH 6.0. Conditioned media was loaded at 
flow rate of 10 ml/min. The peak containing Schwann cell 
proliferation activity and immunoreactive (using the 
polyclonal antisera was against a G6F2 peptide described 
above) was eluted with 50 mM Trie, 1M NaCl pH 8.0. 

15 rhGGF2 is also expressed using a stable Chinese 

Ovary Hamster cell line. rGGF2 from the harvested 
conditioned media was partially purified using Cation 
Exchange Chromatograph (POROS-HS) • The column was 
equilibrated with PBS pH 7.4. Conditioned media was 

20 loaded at 10 ml/min. The peak containing the Schwann 
Cell Proliferative activity and immunoreactivity (using 
GGF2 polyclonal antisera) was eluted with 50 mM Hepes, 
500 mM NaCl pH 8.0. An additional peak was observed at 
50 mM Hepes, 1M NaCl pH 8.0 with both proliferation as 

25 well as immunoreactivity (Fig. 45) . 

rh66F2 can be further purified using Hydrophobic 
Interaction Chromatography as a high resolution step; 
Cation exchange/Reserve phase Chromatography (if needed 
as second high resolution step) ; A viral inactivation 

30 step and a DNA removal step such as Anion exchange 
chromatography. 

Schwann Cell Proliferation Activity of recombinant 
6GF2 peak eluted from the Cation Exchange column was 
determined as follows: Mitogenic responses of the 
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cultured Schwann cells were measured in the presence of 5 
M Forskolin using the peak eluted by 50 mM Trie 1 M NaCl 
pH 8.0. The peak was added at 20 1, 10 1 (1:10) 10 1 and 
(1:100) 10 l. Incorporation of 125 I-Uridine was 
5 determined and expressed as (CPM) following an 18-24 hour 
exposure. 

An immunoblot using polyclonal antibody raised 
against a peptide of 6GF2 was carried out as follows: 10 
1 of different fractions were ran on 4-12% gradient gels, 

10 The gels were transferred on to Nitrocellulose paper , and 
the nitrocellulose blots were blocked with 5% BSA and 
probed with GGF2-specif ic antibody (1:250 dilution). 125 I 
protein A (1:500 dilution, Specific Activity - 9.0/Ci/g) 
was used as the secondary antibody. The immunoblots were 

15 exposed to Kodax X-Ray films for 6 hours. The peak 

fractions eluted with 1 M NaCl showed an immunoreactive 
band at 69K. 

G6F2 purification on cation exchange columns was 
performed as follows: GHO cell conditioned media 

20 expressing rGGFII was loaded on the cation exchange 

column at 10 ml/min. The column was equilibrated with 
PBS pH 7.4. The elution was achieved with 50 mM Hepes 
500 mM NaCl pH 8.0 and 50 mM Hepes 1M NaCl pH 8.0 
respectively. All fractions were analyzed using the 

25 Schwann cell proliferation assay (CPM) described herein. 
The protein concentration (mg/ml) was determined by the 
Bradford assay using BSA as the standard. 

A Western blot using 10 1 of each fraction was 
performed and immunoreactivity and the Schwann cell 

30 activity were observed to co-migrate. 

The protein may be assayed at various points in 
the procedure using a Western blot assay. Alternatively, 
the Schwann cell mitogenic assay described herein may be 
used to assay the expressed product of the full length 
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cl n or any bi logically active p rtions ther of. The 
full length clone GGP2BPP5 has been expressed transiently 
in COS cells. Intracellular extracts of transfected COS 
cells show biological activity when assayed in the 
5 Schwann cell proliferation assay described in Example 8. 
In addition, the full length close encoding 6GF2HBS5 has 
been expressed transiently in COS cells. In this case 
both cell extract and conditioned media show biological 
activity in the Schwann cell proliferation assay 

10 described in Example 8. Any member of the family of 

splicing variant complementary DNA's derived from the GGF 
gene (including the Heregulins) can be expressed in this 
manner and assayed in the Schwann cell proliferation 
assay by one skilled in the art. 

15 Alternatively, recombinant material may be 

isolated from other variants according to Wen et al. 
(Cell £2:559 (1992)) who expressed the splicing variant 
Neu differentiation factor (NDF) in COS-7 cells. cDNA 
clones inserted in the pJT-2 eukaryotic plasmid vector 

20 are under the control of the SV40 early promoter, and are 
3 '-flanked with the SV40 termination and polyadenylation 
signals. COS-7 cells were transfected with the pJT-2 
plasmid DHA by electroporation as follows: 6 x 10 s cells 
(in 0.8 ml of DMEM and 10% FEBS) were transferred to a 

25 0.4 cm cuvette and mixed with 20 /tg of plasmid DHA in 10 
fil of TE solution (10 mM Tris-HCl (pH 8.0), 1 mM EDTA) . 
Electroporation was performed at room temperature at 1600 
V and 25 fiF using a Bio-Rad Gene Pulser apparatus with 
the pulse controller unit set at 200 ohms. The cells 

30 were then diluted into 20 ml of DMEM, 10% FBS and 

transferred into a T75 flask (Falcon) . After 14 hr. of 
incubation at 37 °C, the medium was replaced with DMEM, 1% 
FBS, and the incubation continued for an additional 48 
hr. Conditioned medium containing recombinant protein 
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which was harvested from the cells demonstrated 
biological activity in a cell line expressing the 
receptor for this protein. This cell line (cultured 
human breast carcinoma cell line AU 565) was treated with 
5 recombinant material. The treated cells exhibited a 
morphology change which is characteristic of the 
activation of the erbB2 receptor. Conditioned medium of 
this type also can be tested in the Schwann cell 
proliferation assay. 

10 EXAMPLE 1$ 

H-terminal sequence analysis 
The cDNA encoding hGGF2 was cloned into the 
amplif iable vector pcdhfrpolyA and transfected into CHO- 
DG44 cells for stable expression. rhGGF2 is secreted 

15 into the conditioned media. The ability of the 

recombinant G6F2 to be secreted is presumably mediated 
through the N-terminal hydrophobic stretch (signal 
sequence) . A signal sequence, once having initiated 
export of a growing protein chain across the rough 

20 endoplasmic reticulum, is cleaved from the mature protein 
at a specific site. N-terminal sequence analysis of the 
expressed and purified rhGGF2 indicates the site of 
cleavage as shown below. The sequence of the first 50 
amino acid residues at the N-terminus of the protein was 
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c nf irmed by N-terminal sequence analysis (Table 5) , 

below. 

TABLE 5 

N-terminal sequence analysis of rhGGF2 





Priiiary 
Sequence 


pMoles 


i 


Gly (G) 


210.6 


2 


Asn (N) 


163 


3 


GLU (E) 


149 


4 


Ala (A) 


220 


5 


Ala (A) 


180 


6 


Pro (P) 


173 


7 


Ala (A) 


177 


8 


Gly (G) 


154.9 


9 


Ala (A) 


162.4 


10 


Ser (S) 


65.4 


11 


val (V) 


132.7 


12 


Val (V) *(Cys) 


11.7 


13 


Tyr (Y) 


112.7 


14 


Ser (S) 


47.6 


15 


Ser (S) 


27.1 



10 



15 



20 



Procass 

*Cys residues are destroyed by the Sdman Degradation Process and 
cannot ba datectsd 



25 The following sequence (SEQ ZD NO: 185) represents 

the amino acid sequence of hGGF2« The shaded area 
indicates the cleaved signal sequence* 



30 



m 



GNEAAPAGAS VCYSSPPSVG SVQELAQRAA WIEGKVHPQ RRQQGALDRK 
AAAAAGEAGA HGGDREPPAA GPRALGPPAE EPLZAANGTV PSWPTAPVPS 
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AGEPGEEAPY LVKVHQVWAV KAGGLKKDSL LTVRLGTWGH PAFPSCGRLK 
EDSRYIFFME PDANSTSRAP AAFRASFPPL ETGRNLKKEV SRVLCKRCAL 
PPQLKEMKSQ ESAAGSKLVL RCETSSEY5S LRFKWFKNGN ELNRKNKPQN 
IKIQKKPGKS ELRINKASLA DSGEYMCKVI SKLGNDSASA NITIVE5NAT 
5 STSTTGTSHL VKCAEKEKTP CVNGGECFMV KDLSNPSRYL CKCPNEFTGD 
RCQKYVMASF YSTSTPFLSL PE (SEQ ID NO: 185) 

The shaded area represents experimentally determined 15 
amino acid residues at the N- terminal of the rhGGF2, 
indicating As 0 -G 51 bond to be the cleavage site for the 
10 signal sequence. 

EXAMPLE 17 
Isolation of a Further Splicing Variant 

Methods for updating other neuregulins descsribed 
in U.S. patent application Serial No. 07/965,173, filed 

15 October 23, 1992, incorporated herein by reference, 

produced four closely related sequences (heregulin a, £1, 
02, 03) which arise as a result of splicing variation. 
Peles et al. (Cell £2:205 (1992)), and Wen et al. (Cell 
£2:559 (1992)) have isolated another splicing variant 

20 (from rat) using a similar purification and cloning 
approach to that described in Examples 1*9 and 11 
involving a protein which binds to p!85* rbB2 . The cDNA 
clone was obtained as follows (via the purification and 
sequencing of a pl85* rbB2 binding protein from a 

25 transformed rat fibroblast cell line) • 

A pl85 #rbB2 binding protein was purified from conditioned 
medium as follows. Pooled conditioned medium from three 
harvests of 500 roller bottles (120 liters total) was 
cleared by filtration through 0.2 fi filters and 

30 concentrated 31-fold with a Pelicon ultrafiltration 

system using membranes with a 20kd molecular size cutoff. 
All the purification steps were performed by using a 
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Pharmacia fast pr t in liquid chromatography system. The 
concentrated material was directly loaded on a column of 
heparin-Sepharose (150 ml, preequilibrated with 
phosphate-buffered saline (PBS) ) . The column vas washed 
5 with PBS containing 0.2 M NaCl until no absorbance at 280 
nm wavelength could be detected. Bound proteins were 
then eluted with a continuous gradient (250 ml) of NaCl 
(from 0.2 M to 1,0 M), and 5 ml fractions were collected. 
Samples (0.01 ml of the collected fractions were used for 
10 the quantitative assay of the kinase stimulatory 

activity. Active fractions from three column runs (total 
volume - 360 ml) were pooled, concentrated to 25 ml by 
using a YM10 ultrafiltration membrane (Amicon, Danvers , 
MA) , and ammonium sulfate was added to reach a 
15 concentration of 1.7 M. After clearance by 

centrifugation (10,000 x g, 15 min.), the pooled material 
was loaded on a phenyl-Superose column (HR10/10, 
Pharmacia) • The column was developed with a 45 ml 
gradient of (NH 4 ) 2 S0 4 (from 1.7 M to no salt) in 0.1 M 
20 Na 2 P0 4 (pH 7.4), and 2 ml fractions were collected and 
assayed (0.002 ml per sample) for kinase stimulation (as 
described in Example 19) . The major peak of activity was 
pooled and dialyzed against 50 mM sodium phosphate buffer 
(pH 7.3). A Mono-S cation-exchange column (HR5/5, 
25 Pharmacia) was preequilibrated with 50 mM sodium 

phosphate. After loading the active material (0.884 mg 
of protein; 35 ml) , the column was washed with the 
starting buffer and then developed at a rate of 1 ml/min. 
_ with a gradient of NaCl. The kinase stimulatory activity 
30 was recovered at 0.45*0.55 H salt and was spread over 
four fractions of 2 ml each. These were pooled and 
loaded directly on a Cu +2 chelating columns (1.6 ml, HR2/5 
chelating Superose, Pharmacia). Most of the proteins 
adsorbed to the resin, but they gradually eluted with a 
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30 ml lin ar gradient f ammonium chloride (0-1 M) . The 
activity eluted in a single peak of protein at the range 
of 0.05 to 0.2 M NH 4 C1. Samples from various steps of 
purification were analyzed by gel electrophoresis 
5 followed by silver staining using a Kit from IGN (Costa 
Mesa, GA) , and their protein contents were determined 
with a Coomassie blue dye binding assay using a kit from 
Bio-Rad (Richmond, CA) . 

The p44 protein (10 fig) was reconstituted in 200 

10 ill of 0.1 M ammonium bicarbonate buffer (pH 7.8). 
Digestion was conducted with L-l-tosyl -amide 
2-phenylethyl chloromethyl ketone- treated trypsin (Serva) 
at 37 *C for 18 hr. at an enzyme-to-substrate ratio of 
1:10. The resulting peptide mixture was separated by 

15 reverse-phase HPLC and monitored at 215 nm using a Vydac 
C4 micro column (2.1 mm i.d. x 15 cm, 300 A) and an HP 
1090 liquid chromatographic system equipped with a 
diode-array detector and a workstation. The column was 
equilibrated with 0.1% trif luoroacetic acid (mobile phase 

20 A) , and elution was effected with a linear gradient from 
0%-55% mobile phase B (90% acetonitrile in 0.1% 
trif luoroacetic acid) over 70 min. The flow rate was 0.2 
ml/min. and the column temperature was controlled at 
25 *C. One-third aliquots of the peptide peaks collected 

25 manually from the HPLC system were characterized by 

N-terminal sequence analysis by Edman degradation. The 
fraction eluted after 27.7 min. (T27.7) contained mixed 
amino acid sequences and was further rechromatographed 
after reduction as follows: A 70% aliquot of the peptide 

30 fraction was dried in vacuo and reconstituted in 100 jil 
of 0.2 M ammonium bicarbonate buffer (pH 7.8). DTT 
(final concentration 2 mH) was added to the solution, 
which was then incubated at 37 °C for 30 min. The reduced 
peptide mixture was then separated by reverse-phase HPLC 
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using a Vydac column (2*1 mm i.d. x 15 cm). Elution 
conditions and flow rat 'were identical to those described 
above. Amino acid sequence analysis of the peptide was 
performed with a Model 477 protein sequencer (Applied 
5 Biosystems, Inc., Foster City, CA) equipped with an 

on-line phenylthiohydantoin (PTH) amino acid analyzer and 
a Model 900 data analysis system (Hunkapiller et al. 
(1986) in Methods Qt Protein BigrggharagteriMtipni J.e. 
Shively, ed. (Clifton, New Jersey: Humana Press p. 223- 

10 247). The protein was loaded onto a trifluoroacetic 
acid-treated glass fiber disc precycled with polybrene 
and NaCl. The PTH-amino acid analysis vas performed with 
a micro liquid chromatography system (Model 120) using 
dual syringe pumps and reverse-phase (C-18) narrow bore 

15 columns (Applied Biosystems, 2.1 m x 250 mm). 
RHA was isolated from Ratl-EJ cells by standard 
procedures (Maniatis et al., Molecular cloning: A 
Laboratory Manual (Cold Spring Harbor, New York (1982) 
and poly (A) + vas selected using an ARNA Separator kit 

20 (Clontech Lab, Inc., Palo Alto, CA) . cDNA vas 

synthesized with the Superscript kit (from BRL Life 
Technologies, Inc., Bethesda, MD) • Column-fractionated 
double-strand cDNA vas ligated into an Sail- and 
Notl-digested pJT-2 plasmid vector, a derivative of the 

25 pCD-X vector (Okayama and Berg, Mol. Cell Biol. 2: 280 
(1983)) and transformed into DH10B £• soil cells by 
electroporation (Dover et al., Nucl. Acids Res. 1£: 6127 
(1988)). Approximately 5 x 10 5 primary transformants 
vere screened with tvo oligonucleotide probes that vere 

30 derived from the protein sequences of the N-terminus of 
NDF (residues 5-24} and the T40.4 tryptic peptide 
(residues 7-12) . Their respective sequences vere as 
follows (N indicates all 4 nt) : 
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(1) 5'-ATA GGG AAG GGC GGG GGA AGG GTC NCC CTC NGC 
AT 

AGG GCC GGG CTT GCC TCT GGA GCC TCT-3' 

(2) 5'-TTT ACA CAT ATA TTC NCC-3' 
5 C G G C 

(1: SEQ ID No. 163; 2: SEQ ID No. 164) 

The synthetic oligonucleotides were end-labeled with 
[y- 32 P]ATP with T4 polynucleotide kinase and used to 
screen replicate sets of nitrocellulose filters. The 

10 hybridization solution contained 6 x SSC, 50 mM sodium 
phosphate (pH 6.8) , 0.1% sodium pyrophosphate, 2 x 
Denhardt's solution, SO fig /ml salmon sperm DNA, and 20% 
formamide (for probe 1) or no formamide (for probe 2). 
The filters vere washed at either 50 # C with 0.5 x SSC, 

15 0.2% SDS, 2 IBM 2DTA (for probe 1) or at 37*C with 2 x 

SSC, 0.2% SDS, 2 mM EDTA (for probe 2). Autoradiography 
of the filters gave ten clones that hybridized with both 
probes. These clones were purified by replating and 
probe hybridization as described above. 

20 The cDNA clones were sequenced using an Applied 

Biosystems 3 73 A automated DNA sequencer and Applied 
Biosystems Taq DyeDeoxy" Terminator cycle sequencing kits 
following the manufacture's instructions. In some 
instances, sequences were obtained using [ 35 S]dATP 

25 (Amersham) and Sequenase* kits from U.S. Biochemicals 

following the manufacturer's instructions. Both strands 
of the cDNA clone 44 were sequenced by using synthetic 
oligonucleotides as primers. The sequence of the most 5' 
350 nt was determined in seven independent cDNA clones. 

30 The resultant clone demonstrated the pattern shown in 
Fig. 27 (NDF). 
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EXAMPLE 1? 

Purification and A88av of Other Proteins w hich bind 
pl8 g«rbB3 R g gwtgr 



I* — Purification of cre3 0 and p70 
5 Lupu at al. (Science 2A2L, 1552 (1990)) and Lippman 

and Lupu (patent application number PCT/US91/03443 
(1990)), hereby incorporated by reference, have purified 
a protein from conditioned media of a human breast cancer 
cell line MDA-MB-231. 

1° Lupu et al. (Proc. Natl. Acad. Sci. fll, 2287 

(1992)) purified another protein which binds to the 
piss**** 82 receptor. This particular protein, p75, was 
purified from conditioned medium used for the growth of 
SKBr-3 (a human breast cancer cell line) propagated in 

15 improved Eagle's medium (IMEM: 6IBC0) supplemented with 
10% fetal bovine serum (GIBCO) * 

II. Other nias«rbB2 M„ MnAtf 

Peles et al. (Cell £2, 205 (1992)) have also 
purified a i85* rbB2 stimulating ligand from rat cells. 

20 Holmes et al. (Science 1205 (1992)) have purified 

Heregulin a from human cells which binds and stimulates 
IBS* 1 ** 2 (see Example 5). Tarakovsky et al. Oncogene 
6:218 (1991) have demonstrated bending of a 25 kD 
polypeptide isolated from activated macrophages to the 

25 Neu receptor, a pl85* rbB2 homology, herein incorporated by 
reference. 



in. ffl?r isolation 

Yarden and Peles (Biochemistry 1Q, 3543 (1991)) 
have identified a 35 kilodalton glycoprotein which will 
30 stimulate the i85* rbB2 receptor. 
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In other publications, Davis et al. (Biochem. 
Biophys. Res. Coamun. 179 , 1536 (1991), Proc. Natl. Acad. 
Sci. £&, 8582 (1991) and Greene et al., PCT patent 
application PCT/US91/02331 (1990)) describe the 
5 purification of a protein from conditioned medium of a 
human T-cell (ATL-2) cell line. 

Huang et al. (1992, J. Biol. Chen. 257:11508- 
11512) , hereby incorporated by reference, have isolated 
an additional neu/erb B2 ligand growth factor from bovine 

10 kidney. The 25 kD polypeptide factor was isolated by a 
procedure of column fractionation, followed by sequential 
column chromatography on DEAE/cellulose (DE52), Sulfadex 
(sulfated Sephadex G-50) , heparin-Sepharose 4B, and 
Superdex 75 (fast protein liquid chromatography) • The 

15 factor, NEL-GF, stimulates tyrosine -spec if ic 

autophosphorylation of the neu/erb B2 gene product. 

EL Purification of acetylcholine receptor inducing 

activity (ARIA) 

ARIA, a 42 kD protein which stimulates 

20 acetylcholine receptor synthesis, has been isolated in 
the laboratory of Gerald Fischbach (Falls et al., (1993) 
Cell 72:801-815). ARIA induces tyrosine phosphorylation 
of a 185 Rda muscle transmembrane protein which resembles 
pl85 arbB2 , and stimulates acetylcholine receptor synthesis 

25 in cultured embryonic myotubes. ARIA is most likely a 
member of the GGF/erbB2 ligand group of proteins, and 
this is potentially useful in the glial cell mitogenesis 
stimulation and other applications of, e.g., GGF2 
described herein. 
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EXAMPLE 19 

Protein tyrosine phosphorylation mediated b Y ccy 

Rat Schwann cells, following treatment with 
sufficient levels of Glial Growth Factor to induce 
5 proliferation, show stimulation of protein tyrosine 
phosphorylation. Varying amounts of partially purified 
GGF were applied to a primary culture of rat Schwann 
cells according to the procedure outlined in Example 9. 
Schwann cells were grown in DMEM/10% fetal calf serum/ 5 
10 mm forskolin/0.5/ig per mL GGF-GM (O.SmL per well) in poly 
D-lysine coated 24 well plates. When confluent, the 
cells were fed with DMEM/10% fetal calf serum at 0*5mL 
per well and left in the incubator overnight to quiesce. 
The following day, the cells were fed with 0.2mL of 
15 DMEM/10% fetal calf serum and left in the incubator for 1 
hour. Test samples were then added directly to the 
medium at different concentrations and for different 
lengths of time as required. The cells were then lysed 
in boiling lysis buffer (sodium phosphate, 5mM, pH 6.8; 
20 SDS, 2%, 0-mercapteothanol, 5%; dithiothreitol, 0.1M; 
glycerol, 10%; Bromophenol Blue, 0.4%; sodium vanadate, 
10mM) , incubated in a boiling water bath for 10 minutes 
and then either analyzed directly or frozen at -70 # C. 
Samples were analyzed by running on 7.5% SDS-PAGE gels 
25 and then electroblotting onto nitrocellulose using 

standard procedures as described by Towbin et al. (1979) 
Proc. Natl. Acad. Sci. USA 76:4350*4354. The blotted 
nitrocellulose was probed with antiphosphotyrosine 
antibodies using standard methods as described in Kamps 
30 and Selton (1988) Oncogene 2:305-315. The probed blots 
were exposed to autoradiography film overnight and 
developed using a standard laboratory processor. 
Dens it ometric measurements were carried out using an 
Ultrascan XL enhanced laser densitometer (LKB) . 
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Molecular v ight assignments v re mad relative t 
prestalned high molecular weight standards (Sigma) . The 
dose responses of protein phosphorylation and Schwann 
cell proliferation are very similar (Fig. 33). The 
5 molecular weight of the phosphorylated band is very close 
to the molecular weight of pl85* rbB2 . Similar results 
were obtained when Schwann cells were treated with 
conditioned media prepared from COS cells translates with 
the 6GF2HBS5 clone. These results correlate well with 
10 the expected interaction of the GGFs with and activation 
of IBS*** 2 . 

This experiment has been repeated with recombinant 
GGF2. Conditioned medium derived from a CHO cell line 
stably transformed with the GGF2 clone (66F2HBS5) 
15 stimulates protein tyrosine phosphorylation using the 
assay described above. Mock transf acted CHO cells fail 
to stimulate this activity. 

EXAMPLE 20 

N-alvcoBVlation of GGF 

20 The protein sequence predicted from the cDNA 

sequence of GGF- I I candidate clones GGF2BPP1,2 and 3 
contains a number of consensus N-glycosylation motifs. A 
gap in the GGFII02 peptide sequence coincides with the 
asparagine residue in one of these motifs, indicating 

25 that carbohydrate is probably bound at this site. 

N-glycosylation of the GGFs was studied by 
observing mobility changes on SDS-PAGE after incubation 
with N-glycanase, an enzyme that cleaves the covalent 
linkages between carbohydrate and aspargine residues in 

30 proteins. 

N-Glycanase treatment of GGF-II yielded a major 
band of MW 40-42 kDa and a minor band at 45-48 kDa. 
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Activity singl active d glycosylated species at ca 45- 
50 JcDa. 

Activity elution experiments with GGF-I also 
demonstrate an increase in electrophoretic mobility when 
5 treated with N-Glycanase, giving an active species of MW 
26-28 kDa. Silver staining confirmed that there is a 
mobility shift, although no N-deglycosylated band could 
be assigned because of background staining in the sample 
used. 
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(1) GENERAL INFORMATION t 

(i) APPLICANTS : Robert Sklar, Mark Marchionni, 
David I. G wynne 

(11) TITLE OF INVENTION: METHODS FOR ALTERING 

MUSCLE CONDITION 

(ill) NUMBER OF SEQUENCES: 185 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSES: Fish & Richardson 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 
(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 5.25 inch, 360 

kb storage 

(B) COMPUTER: IBM 

(C) OPERATING SYSTEM: PC-DOS 

(D) SOFTWARE: WordPerfect 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 94/05083 A 

(B) FILING DATE: 06-MAY-94 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/209 , 204 

(B) FILING DATE: 08-MAR-94 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/059,022 

(B) FILING DATE: 06-May-93 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE / DOCKET NUMBER: 04585/028WO1 

(iX) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 542-5070 

(B) TELEFAX: (617) 542-8906 
(B) TELEX: 200154 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Phe Lys Gly Asp Ala His Thr Glu 

1 5 



(2) INFORMATI N FOR SEQUENCE IDENTIFICATION NUMBER: 2: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 13 

(B) TYPE; amin acid 

(C) STRANDED HESS : 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION! Xaa in position 1 is Lysine 

or Arginine; Xaa in position 
12 is unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Ala Sex Leu Ala Asp Glu Tyr Glu Tyr Met Xaa Lys 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine 

or Arginine; Xaa in 
position 10 is unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Thr Glu Thr Ser Ser Ser Gly Leu Xaa Leu Lys 

1 5 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Xaa Lys Leu Gly Glu Met Trp Ala Glu 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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( ix ) FEATURE : 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Leu Gly Glu Lys Arg Ala 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginina. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Xaa He Lys Ser Glu His Ala Gly Lau Sar I la Gly Asp Thr Ala Lys 

15 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

( ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Xaa Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Arg Lys 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginina. 

(xi) SEQUENCE DESCRIPTI N: SEQ ID NO: 8: 
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Xaa lie Lys Cly Clu His Pro Cly Lou ser lie Gly Asp Val Ala Lye 

1 5 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linaar 

fix) FEATURE: 

(D) OTHER INFORMATION: Xaa in poaition 1 it Lysine or 

Arginine and Xaa in poaition 
12 is unknown. 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 9: 

Xaa Mat Sar Glu Tyr Ala Pha Pha Val Gin Thr Xaa Arg 

1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linaar 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Xaa Sar Glu His Pro Gly Leu Ser He Gly Asp Thr Ala Lys 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine; Xaa in position 8 is 
unknown* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Xaa Ala Gly Tyr Phe Ala Glu Xaa Ala Arg 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 12: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linaar 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysina or 

Arginina; Xaa in position 7 is 
unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Lye Lau Glu Phe Lau Xaa Ala Lys 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linaar 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginina. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Xaa Thr Thr Glu Mat Ala Ser Glu Gin Gly Ala 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linaar 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysina or 

Arginina. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Xaa Ala Lya Glu Ala Lau Ala Ala Lau Lys 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linaar 
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(ix) FEATURE x 

(D) OTHER INFORMATI N: Xaa in position 1 is Lysine or 

Arginine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa Pho Val Lou Gin Ala Lye Lys 

1 5 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Xaa Leu Gly Glu Met Trp 

1 5 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Tyr Lys Cys Leu Lys Phe Lys Trp Phe Lys Lys Ala Thr Val Met 

1 5 10 15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position S is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 18: 

Glu Ala Lys Tyr Phe Ser Lys Xaa Asp Ala 

15 10 

(2) INFORMATI N FOR SEQUENCE IDENTIFICATION NUMBER: 19: 
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(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 7 

(B) TYPE: amino acid 

(C) STRANDED NESS: 

(D) TOPOLOGY: Unttr 

(ix) FEATURE: 

(D) OTHER INFORMATION : Xaa in position 2 is unknown, 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 

Glu Xaa Lys Pha Tyr Val Pro 

1 5 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 20: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Glu Leu Sar Pha Ala Sar Val Arg Lau Pro Gly Cya Pro Pro Gly Val 

1 5 10 15 

Aap Pro Mat Val Sar Pha Pro Val Ala Lau 

20 25 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 21: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2003 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N in positions 31 and 32 could 

be either A or G. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGAATTCCTT m ' TT TTITT TTT TTTT C T T HNTTTTTTTT TGCCCTTATA CCTCTTCGCC 60 

TTTCTGTGGT TCCATCCACT TCTTCCCCCT CCTCCTCCCA TAAACAACTC TCCTACCCCT 120 

GCACCCCCAA TAAATAAATA AAAGGAGGAG GGCAAGGGGG CAGGAGGAGG AGTGGTGCTG 180 

OGAGGGGAAG GAAAAGGGAG GCAGCGCGAG AAGAGCCGGG CAGAGTCCGA ACCGACAGCC 240 

AGAAGCCCGC ACGCACCTCG CACC ATG AGA TGG CGA CGC GCC CCG CGC CGC 291 

Mat Arg Trp Arg Arg Ala Pro Arg Arg 
1 5 

TCC GGG CGT CCC GGC CCC CGG GCC CAC CGC CCC GCC TCC GCC GCC CGC 339 
Sar Gly Arg Pro Gly Pro Arg Ala Gin Arg Pro Gly Sar Ala Ala Arg 
10 15 20 25 
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TCO TCG CCO COG CTC CCG CTC CTG CCA CTA CTG CTG CTG CTG GGG ACC 387 
Ser Ser Pro Pro Leu Pro Leu Leu Pro Leu Leu Leu Leu Leu Gly Thr 
30 35 40 

GCG GCC CTG CCG CCG GGG GCG GCG GCC GGC AAC GAG GCG GCT CCC GCG 435 
Ale Ale Leu Ala Pro Gly Ale Ala Ala Gly Asn Glu Ala Ala Pro Ala 
45 50 55 

GGG GCC TCG GTG TGC TAG TCG TCC CCG CCC AGC GTG GGA TCG GTG CAG 483 
Gly Ala Ser Val Cye Tyr Ser Ser Pro Pro Ser Val Gly Ser Val Gin 

60 65 70 

GAG CTA GCT CAG CGC GCC GCG GTG GTG ATC GAG GGA AAG GTG CAC CCG 531 
Glu Leu Ala Gin Arg Ala Ala Val Val He Glu Gly Lye Val Hie Pro 
75 80 85 

CAG CGC CGG CAG CAG GGG GGA CTC GAC AGG AAC GCG GCC GCC GCG GCC 579 
Gin Arg Arg Gin Gin Gly Ala Leu Aep Arg Lye Ala Ala Ala Ala Ala 
90 95 100 105 

GGC GAG GCA GGG GCG TGG GGC GGC GAT CGC GAG CCC CCA GCC GCG GGC 627 
Gly Glu Ala Gly Ala Trp Gly Gly Aep Arg Glu Pro Pro Ala Ala Gly 
110 115 120 

CCA CGG GCG CTG GGC CCG CCC GCC GAG GAG CCC CTG CTC GCC GCC AAC 675 
Pro Arg Ala Leu Gly Pro Pro Ala Glu Glu Pro Leu Leu Ala Ala Asn 
125 130 135 

GGG ACC GTG CCC TCT TGG CCC ACC GCC CCG GTG CCC AGC GCC GGC GAG 723 
Gly Thr Val Pro Ser Trp Pro Thr Ala Pro Val Pro Ser Ala Gly Glu 
140 145 150 

CCC GGG GAG GAG GCG CCC TAT CTG GTG AAG GTG CAC CAG GTG TGG GCG 771 
Pro Gly Glu Glu Ala Pro Tyr Leu Val Lye Val Hie Gin Val Trp Ala 
155 160 165 

GTG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG CTC ACC GTG CGC CTG 819 
Val Lye Ala Gly Gly Leu Lye Lye Aep Ser Leu Leu Thr Val Arg Leu 
170 175 180 185 

GGG ACC TGG GGC CAC CCC GCC TTC CCC TCC TGC GGG AGG CTC AAG GAG 867 
Gly Thr Trp Gly Hie Pro Ala Phe Pro Ser Cye Gly Arg Leu Lye Glu 
190 195 200 

GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAC GCC AAC AGC ACC AGC 915 
Aep Ser Arg Tyr He Phe Phe Met Glu Pro Aep Ala Asn Ser Thr Ser 
205 210 215 

CGC GCG CCG GCC GCC TTC CGA GCC TCT TTC CCC CCT CTG GAG ACG GGC 963 
Arg Ala Pro Ala Ala Phe Arg Ala Ser Phe Pro Pro Leu Glu Thr Gly 
220 225 230 

CGG AAC CTC AAG AAG GAG GTC AGC CGG GTG CTG TGC AAG CGG TGC GCC 1011 
Arg Asn Leu Lye Lys Glu Val Ser Arg Val Leu Cys Lys Arg Cys Ala 
235 240 245 

TTG CCT CCC CAA TTG AAA GAG ATG AAA AGC CAG GAA TCG GCT GCA GGT 1059 
Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala Gly 
250 255 260 265 

TCC AAA CTA CTC CTT CGG TGT GAA ACC ACT TCT GAA TAC TCC TCT CTC 1107 
Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
270 175 180 

AGA TTC AAG TGG TTC AAG AAT GGG AAT GAA TTG AAT CGA AAA AAC AAA 1155 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lya 

185 190 195 
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CCA CAA AAT ATC AAC ATA CAA AAA AAG CCA GGG AAG TCA GAA CTT CCC X203 
Pro Gin Asn lie Lys lie Gin Lye Lya Pr Gly Lya Sar Glu Leu Arg 

200 205 210 

ATT AAC AAA GCA TCA CTO GCT GAT TCT GGA GAG TAT ATC TGC AAA GTG 1251 
lie Aan Lys Ala Sar Leu Ala Asp Sar Gly Glu Tyr Mat Cys Lys Val 

215 220 225 

ATC AGC AAA TTA GGA AAT GAC ACT GCC TCT GCC AAT ATC ACC ATC GTG 1299 
I la Sar Lya Lau Gly Aan Aap Smr Ala Sar Ala Aan I la Thr I la Val 
230 235 240 245 

GAA TCA AAC GCT ACA TCT ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA 1347 
Glu Sar Aan Ala Thr Sar Thr Sar Thr Thr Gly Thr Sar Hia Lau Val 
250 255 260 

AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 1395 
Lya Cya Ala Glu Lya Glu Lya Thr Pha Cys Val Aan Gly Gly Glu Cys 
265 270 275 

TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTC TGC AAG TGC 1443 
Pha Mat Val Lys Aap Lau Sar Asn Pro Sar Arg Tyr Lau Cya Lya Cys 
280 285 290 

CCA AAT GAG TTT ACT GGT GAT CGC TCC CAA AAC TAC GTA ATG GCC AGC 1491 
Pro Asn Glu Pha Thr Gly Asp Arg Cys Gin Asn Tyr Val Mat Ala Sar 
295 300 305 

TTC TAC ACT ACG TCC ACT CCC TTT CTG TCT CTC CCT GAA 1530 
Pha Tyr Sar Thr Sar Thr Pro Pha Lau Sar Lau Pro Glu 
400 405 410 



TACGAGCATG 


CTCAGTTGGT 


GCTGCTTTCT 


TGTTGCTGCA 


TCTCCCCTCA 


GATTCCACCT 


1590 


AGAGCTAGAT 


GTGTCTTACC 


AGATCTAATA 


TTGACTGCCT 


CTGCCTGTCG 


CATGAGAACA 


1650 


TTAACAAAAG 


CAATTGTATT 


ACTTCCTCTG 


TTCGCGACTA 


GTTGGCTCTG 


AGATACTAAT 


1710 


AGGTGTGTGA 


GGCTCCGGAT 


GTTTCTGGAA 


TTGATATTGA 


ATGATGTGAT 


ACAAATTGAT 


1770 


AGTCAATATC 


AAGCAGTGAA 


ATATGATAAT 


AAAGGCATTT 


CAAAGTCTCA 


CTTTTATTGA 


1830 


TAAAATAAAA 


ATCATTCTAC 


TGAACAGTCC 


ATCTTCTTTA 


TACAATGACC 


ACATCCTGAA 


1890 


AAGGGTGTTG 


CTAAGCTGTA 


ACCGATATGC 


ACTTGAAATG 


ATGGTAAGTT 


AATTTTGATT 


1950 


CAGAATGTGT 


TATTTGTCAC 


AAATAAACAT 


AATAAAAGGA 


AAAAAAAAAA 


AAA 


2003 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 22: 
(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 12 

(B) TYPE: amino acid 

(C) STRANDED NESS: 

(0) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 11 is unknown* 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Ala Sar Lau Ala Asp Glu Tyr Glu Tyr Met Xaa Lys 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 23: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 9 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Thr Glu Thr Ser Ser Ser Gly Leu Xaa Leu Lya 

15 io 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Arg Lys 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 7 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Gly Tyr Phe Ala Glu Xaa Ala Arg 

15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Thr Thr Glu Met Ala Ser Glu Gin Gly Ala 

1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANOEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ala Lys Glu Ala Leu Ala Ala Lau Lya 

1 S 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Pha Val Lau Gin Ala Lya Lya 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Glu Thr Gin Pro Aap Pro Gly Gin lie Leu Lye Lys Val Pro Met Val 

15 10 15 

lie Gly Ala Tyr Thr 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: Xaa in positions 1, 3, 17 and 

19 is unknown. 

(xi) SEQUENCE DESCRIPTI N: SEQ ID N : 30: 

Xaa Glu Xaa Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys Glu 

15 10 15 
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Xaa ly Xaa Gly Lys 

20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 31: 
(i) SEQUENCE CHARACTERISTICS j 

(A) LENGTH: 13 

(B) TYPE: amino acid 
<C) STRANDEDNBSS x 

(D) TOPOLOGY: linaar 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 31: 

Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 32: 
(1) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDSDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION : Xaa in position 6 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Lys Leu Glu Phe Leu Xaa Ala Lys 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: 

(D) TOPOLOGY: linear 

( ix) FEATURES 

(D) OTHER INFORMATION: Xaa in position 1 is Lysina or 

Arginine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Xaa Val His Gin Val Trp Ala Ala Lys 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 34: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : 

(D) TOPOLOGY: linear 
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(ix) FEATURES 

(D) OTHER INFORMATI N: Xaa in po ait ion 1 la Lysine or 

Arginine, Xaa in position 11 
is unknown. 

(xi) SEQUENCE DESCRIPTION x SEQ ID NO: 34: 

Xaa Tyr I la Phe Phe Hat Glu Pro Glu Ala Xaa Ser Ser Gly 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine, Xaa in position 13 
is unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Xaa Leu Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Xaa Trp Phe Val Val lie Glu Gly Lys 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in positi n 1 is Lysine or 

Arginine* 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Xaa Ala Ser Pr Val Ser Val Cly Ser Val Cln Clu L u Val Gin Arg 

1 5 10 15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 38: 
(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linear 

(ix> FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or 

Arginine* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Xaa Val Cys Leu Leu Thr Val Ala Ala Leu Pro Pro Thr 

1 5 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

<ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in poaition 1 is Lysine or 

Arginine; Xaa in position 6 is 
unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Xaa Asp Leu Leu Leu Xaa Val 

1 5 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Cys Thr Cys Gly Cys Cys Lys Cys Cya Arg Thr Thr Cys Ala Cya Arg 

15 10 15 

Cys Ala Gly Ala Ala Gly Gly Thr Cys Thr Thr Cys Thr Cys Cys Thr 

20 25 30 

Thr Cys Thr Cys Ala Gly Cys 

35 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATI N NUMBER I 41 : 
(i) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRAND ED NESS t 

(D) TOPOLOGY* linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 41: 

Cya Cya Thr Cya Gly Cya Thr Cya Cya Thr Thr Cya Thr Thr Cya Thr 

15 10 15 

Thr Gly Cya Cya Cya Thr Thr Cya 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 42: 
(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linaar 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Val Hia Gin Val Trp Ala Ala Lya 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 
<C) STRANDED NESS s 

(D) TOPOLOGY: linaar 

<ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in poaition 10 ia unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Tyr II* Phe Phe Mot Glu Pro Glu Ala Xaa Ser Smr Gly 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linaar 

(lx) FEATURE: 

(D) OTHER INFORMATION: Xaa in poaition 12 ia unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
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Leu Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 

15 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 45s 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 8 

(B) TYPE: amino acid 

(C) STRANDEDNESSs 

(0) TOPOLOGY: linaar 

(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 45: 

Trp Phe Val val I la Glu Gly Lya 

15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linaar 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Ala Sar Pro Val Sar Val Gly Sar Val Gin Glu Leu Val Gin Arg 

15 10 15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 47: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Val Cys Leu Leu Thr Val Ala Ala Leu Pro Pro Thr 

1 5 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Lya Val Hie Gin Val Trp Ala Ala Lya 

1 5 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 49: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 12 is unknown, 
(xi) SEQUENCE DESCRIPTION x SEQ ID NO: 49: 

Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Xaa Lys 

1 5 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 

(B) TYPE: amino acid 

(C) STRANDED NESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 5 is 

unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Asp Leu Leu Leu Xaa Val 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 51: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
TTYAARGGNG AYGCNCAYAC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
CATRTAYTCR TAYTCRTCNG C 



(2) INFORMATI N FOR SEQUENCE IDENTIFICATION NUMBER: 53: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 
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(B) TYPSs nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
TCYTCNGANC CCATYTCNGT 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
TGYTGRCTNG CCATYTCNGT 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CCDATNACCA TNGGNACYTT 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GCNGCCCANA CYTGRTGNAC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
GCYTGNGGYT CCATRAARAA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 58: 
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(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTHS 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY t linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58s 
CCYTCDATNA CNACRAACCA 20 

<2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBERS 59s 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
TCNGGRAART ANCGNGC 17 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 20 

(B) TYPBs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GCNGCNAGNG CYTCYTTNGC 20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 61: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GCNGCYAANG CYTCYTTNGC 20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 62: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 20 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
TTYTTNGCYT CNAGNACRAA 20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 63 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TTYTTNGCYT GYAANACRAA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 64 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
TGNACNAGYT CYTGNAC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 65: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
TGNACYAAYT CYTGNAC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CATRTAYTCN CCNGARTCNG C 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 67: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N : 67: 
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CATRTAYTCN CGRCTRTCNG C 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
NGARTCNGCY AANGANGCYT T 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 69: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

NGARTCNGCN AGNGANGCYT T 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
RCTRTCNGCY AANGANGCYT T 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 71: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sing la 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
RCTRTCNGCN AGNGANGCYT T 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 72: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
NGARTCNGCY AARCTNGCYT T 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 73: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
NGARTCNGCN AGRCTNGCYT T 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 74: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
RCTRTCNGCY AARCTNGCYT T 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 75: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
RCTRCTNGCN AGRCTNGCYT T 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 76: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
ACNACNGARA TGGCTCNNGA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 77: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 
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<C) STRANDEDNBSS i single 
(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

ACNACNGARA TGGCAGYNGA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 78: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CAYCARGTNT GGGCNGCNAA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 79: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS: single 

(D) TOPOLOGY: linear 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
TTYGTNGTNA THGARGGNAA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 80: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
AARGGNGAYG CNCAYACNGA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 81: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GARGCNYTNG CNGCNYTNAA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 82: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 82: 

GTNGGNTCNG TNCARGARYT 20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 83: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTB: 20 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GTNGGNAGYG TNCARGARYT 20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 84: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
NACYTTYTTN ARDATYTGNC C 21 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 85: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: Xaa in positions 14, 23, 

90, 100, 126, and 13S is a 
stop codon. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TCTAA AAC TAC AGA GAC TGT ATT TTC ATG ATC ATC ATA GTT CTG TGA AAT ATA 53 
Aen Tyr Arg Asp Cys lie Phe Met He He He Val Leu Xaa Asn He 

15 10 15 

CTT AAA CCG CTT TGG TCC TGA TCT TGT AGG AAG TCA GAA CTT CGC ATT 101 
Leu Lys Pro Leu Trp Ser Xaa Ser Cys Arg Lys Ser Glu Leu Arg He 
20 25 30 

AGC AAA GCG TCA CTG GCT CAT TCT GGA GAA TAT ATG TGC AAA GTG ATC 149 
Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Ser Met Cys Lys Val He 
35 40 45 
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AGC AAA CTA GCA AAT GAC ACT GCC TCT GCC AAC ATC ACC ATT CTG GAG 197 
Ser Lya Leu Gly Aan Aap Sar Ala Ser Ala Aan lie Arg I la Val Glu 
50 55 60 

TCA AAC GGT AAC AGA TGC CTA CTG CGT GCT ATT TCT CAC TCT CTA AGA 245 
Ser Aan Gly Lya Arg Cya Lau Leu Arg Ala lie Sar Gin Sar Leu Arg 

65 70 75 80 

GGA GTG ATC AAG GTA TCT GGT CAC ACT TGA ATC ACG GAG CTG TGT GAA 293 
Gly Val Ila Lya Val Cya Gly Hia Thr Xaa Ila Thr Gin Val Cya Glu 
85 90 95 

ATC TCA TTG TGA ACA AAT AAA AAT CAT GAA AGG AAA ACT CTA TGT TTC 341 
Ila Sar Cya Xaa Thr Aan Lya Asn Hia Glu Arg Lya Thr Lau Cya Lau 

100 105 110 

AAA TAT CTT ATG GGT CCT CCT GTA AAG CTC TTC ACT CCA TAA GGT GAA 389 
Lya Tyr Lau Hat Gly Pro Pro Val Lya Lau Pha Thr Pro Xaa Gly Glu 
115 120 125 

ATA GAC CTG AAA TAT ATA TAG ATT ATT T 417 
lie Aap Leu Lya Tyr lie Xaa Ila lie 

130 135 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 86: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

<ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 19, 25, and 

31 is Inosine. Y can be 
cytidlne or thymidine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

CCGAATTCTG CAGGARACNC ARCCNGAYCC NGG 33 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 87: 
(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 37 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 14, 20, 23, 

29, and 35 is Inosine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

AACGATCCTG CAGNGTRTAN GCNCCDATNA CCATNGG 37 

<2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 88: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTHS 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS i single 

(D) TOPOLOGY: linear 

( ix ) FEATURE: 

(D) OTHER INFORMATION: N at petitions 16, 21, and 

24 is Inosine. Y can be 
cytidine or thymidine. 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 88: 

CCGAATTCTG CAGGCNCAYT CNGCNGARTA YATG 34 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 89: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 16 and 25 is 

Inosine. Y can be cytidine 
or thymidine. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

CCGAATTCTG CAGCCNGAYA GYGGNGARTA YAT 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 90: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: N at positions 14, 15, 16, 

26, and 29 is Inosine. Y 
can be cytidine or 
thymidine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
AAGGATCCTG CAGNNNCATR TAYTCNCCNG ARTC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 91: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 
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(D) OTHER INFORMATION : N at positions 14, 15, 16, 

and 26 is I no sine. Y can 
be cytidine r thymidine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

AAGCATCCTG CAGNNNCATR TAYTCNCCRC TRTC 34 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 21, 28, and 

31 is Inosine. Y can be 
cytidine or thymidine. 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 92: 

CCGAATTCTG CAGCAYCARG TNTGGGCNGC NAA 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 31 is 

Inosine. Y can be cytidine 
or thymidine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CCGAATTCTG CAGATHTTYT TYATGGARCC NGARG 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 94: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: N at positions IB, 21, 24, 

27, and 33 is Inosine. Y 
can be cytidine or 
thymidine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
CCGAATTCTG CAGGGCGNCC NCCNGCNTTY CCNGT 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 21 and 24 is 

Inosine. Y can be cytidine 
or thymidine. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

CCGAATTCTG CAGTGGTTYG TNGTNATHGA RGG 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 96: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 17, 20, and 

26 is Inosine. Y can be 
cytidine or thymidine. 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 96 

AAGGATCCTG CAGYTTNGCN GCCCANACYT GRTG 34 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 97: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 19 is 

Inosine. Y can be cytidine 
or thymidine. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

AAGGATCCTG CAGGCYTCNG GYTCCATRAA RAA 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 98: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

( ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 16, 22, 25, 

28, and 31 is I no sins* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
AAGGATCCTG CACACNGCRA ANGCNGGNCG NCC 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 99: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: N at positions 17, 26, and 

29 is Inosine. Y can be 
cytidine or thymidine* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

AAGGATCCTG CAGYTTNCCY TCDATNACNA CRAAC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 100: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: N at position 18 is 

Inosine. Y can be cytidine 
or thymidine* 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

CATRTAYTCR TAYTCTCNGC AAGGATCCTG CAG 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 101: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 19, 25, and 

31 is Inosine* Y can be 
cytidine or thymidine. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
CCCAATTCTC CAGAARGGNG AYGCNCAYAC NGA 33 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 102: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 3 and 18 is 

Inosine. Y can be cytidine 
or thymidine. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

GCNGCYAANG CYTCYTTNGC AAGGATCCTG CAC 33 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 103: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 3, 6, 9, and 

18 is Inosine. Y can be 
cytidine or thymidine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GGNGCNAGNG CYTCYTTNGC AAGGATCCTG CAG 33 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 104: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 3, 12, and 15 

is Inosine. Y can be 
cytidine or thymidine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

TCNGCRAART ANCCNCCAAG GATCCTCCAG 30 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 105: 
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(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTHS 38 

(B) TYPEs nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY s linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



CATCGATCTG CAGGCTGATT CTGGAGAATA TATGTGCA 



36 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 106: 
(i) SEQUENCE CHARACTERISTICS: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
AACGATCCTG CAGCCACATC TCGAGTCGAC ATCGATT 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 107: 
(i) SEQUENCE CHARACTERISTICS: 



<C) STRAND EDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

CCGAATTCTG CAGTGATCAG CAAACTAGGA AATGACA 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 108: 
(i) SEQUENCE CHARACTERISTICS: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
CATCGATCTG CAGCCTAGTT TGCTGATCAC TTTGCAC 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 109: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 37 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(A) LENGTH: 

(B) TYPE: 



37 

nucleic acid 



(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



AAGGATCCTG CAGTATATTC TCCAGAATCA GCCAGTG 



37 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 110: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 110: 
AAGGATCCTG GAGCGACGGA GTAGGCATCT CTTA 34 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 111: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
CCGAATTCTG CAGCAGAACT TCGCATTAGC AAAGC 35 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 112: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
CATCCCGGGA TGAAGAGTCA GGAGTCTGTG GCA 33 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 113: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPEs . nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
ATACCCGGGC TGCAGACAAT GAGATTTCAC ACACCTGCG 39 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 114: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
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AAGCATCCTG CAGTTTGGAA CCTCCCACAC ACTCCT 36 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 115: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
ATACCCGGGC TGCAGATGAG ATTTCACACA CCTGCGTGA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 116: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

His Gin Val Trp Ala Ala Lya Ala Ala Gly Lau Lya 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 117: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linaar 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Gly Gly Lau Lya Lya Asp Ser Lau Leu Thr Val Arg Leu Gly Ala Aan 
1 5 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 118: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 12 is 

unknown . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Leu Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 

15 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 119: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE i amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Lau Lau Thr Val Arg Lau Gly Ala Trp Gly Hia Pro Ala Pha Pro Sar 

15 10 15 

Cya Gly Arg Lau Lya Glu Aap 

20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 120: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linaar 

<ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 10 ia 

unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Tyr I la Phe Pha Mat Glu Pro Glu Ala Xaa Ser Ser Gly 

15 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 121: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linaar 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Lya Glu Aap Sor Arg Tyr Ila Pha Pha Mat Glu Pro Glu Ala Aan Sar 

1 5 10 15 

Sar Gly Gly Pro Gly Arg Lau 
20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 122: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linaar 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Val Ala Gly Sar Lya Lau Val Lau Arg Cya Glu Thr Sar Sar 

15 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 123: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Glu Tyr Lys Cys Lou Lys Phe Lys Trp Phe Lye Lye Ala Thr Val Met 

15 10 IS 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 124: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 26 

<B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Cya Glu Thr Ser Ser Glu Tyr Ser Sex Lou Lye Phe Lye Trp Phe Lys 
15 10 15 

Aan Gly Ser Glu Leu Ser Arg Lye Asn Lya 

20 25 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 125: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(D) OTHER INFORMATION: Xaa in position 12 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

Lye Ala Ser Leu Ala Aap Ser Gly Glu Tyr Met Xaa Lys 

1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 126: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 126: 

Glu Leu Arg lie Ser Lye Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met 

1'5 10 15 

Cys Lys Val lie Ser Lys Leu 

20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 127: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEONESS: 

(D) TOPOLOGY: lino ax 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 127: 

Ala Ser Lam Ala Asp Glu Tyr Glu Tyr Mot Arg Lya 

15 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 128: 
(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 22 

(B) TYPE: amino acid 

(C) STRANDEONESS: 

(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Leu Arg I la Ser Lya Ala Sex Leu Ala Asp Ser Gly Glu Tyr Met Cys 

1 5 10 is 

Lys Val lie Ser Lys Leu 

20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 129: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

CCTGCAG CAT CAA GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCC CTG 55 
His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu 

15 10 15 

CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 103 
Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser Cys 
20 25 30 

GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 151 
Gly Arg Leu Lys Glu Asp Ser Arg Tyr He Phe Phe Met Glu Pro Glu 
35 40 45 

GCC AAC AGC AGC GCC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 199 
Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro Pro 
50 55 60 

TCT CCA GAC GGG CCG GAA CCT CAA GAA GGA GGT GAG CCC GGT GCT GTG 247 
Ser Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 
65 70 75 80 

CAA CGG TGC GCC TTG CCT CCC CGC TT AAA GAG ATG AAG AGT GAG GAG 295 
Gin Arg Cys Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
85 90 95 
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TCT GTG GCA GOT TCC AAA CTA GTG CTT CGG TGC GAG ACC AGT TCT GAA 343 
Ser Val Ala ly Ser Lye Leu Val Lou Arg Cya Glu Thr Ser Ser GXu 

100 105 110 

TAG TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT GAA TTA AGC 391 
Tyr Ser Smr Leu Lye Phe Lye Trp Phe Lye Aan Gly Ser Glu Leu Smr 
115 120 125 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 439 
Arg Lya Aan Ly* Pro Glu Aan lie Lya lie Gin Ly* Arg Pro Gly Lye 
130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCG TCA CTC GCT GAT TCT GGA GAA TAT 487 
Ser Glu Lau Arg II* Ser Lye Ala Smr Leu Ala Asp Ser Gly Glu Tyr 
145 150 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 535 
Mat Cya Lya Val I la Sar Lye Lau Gly Aan Asp Sar Ala Sar Ala Aan 

165 170 175 

ATC ACC ATT GTG CAG TCA AAC GGT AAG AGA TGC CTA CTC CCT GCT ATT 583 
I la Thr Ila Val Glu Sar Aan Gly Lya Arg Cys Lau Lau Arg Ala lie 
180 185 190 

TCT CAG TCT CTA AGA GGA GTG ATC AAG GTA TCT GGT CAC ACT 625 
Sar Gin Sar Lau Arg Gly Val Ila Lya Val Cya Gly Hia Thr 
195 200 205 

TGAATCACGC AGGTGTGTGA AATCTCATTG TGAACAAATA AAAATCATGA AAGGAAAAAA 685 

AAAAAAAAAA AATCGATGTC GACTCGAGAT CTGGCTGCAG GTCGACTCTA GAGGATCCC 744 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 130: 
(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 1193 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CCTCCAG CAT CAA GTG TGG GCG GCG AAA GCC GGG GGC TTC AAG AAG GAC TCG CTC 
Hia Gin Val Trp Ala Ala Lya Ala Gly Gly Leu Lya Lya Aap Sar Leu 

15 10 15 

CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 1 
Leu Thr Val Arg Leu Gly Ala Trp Gly Hia Pro Ala Phe Pro Ser Cys 
20 25 30 

GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 1 
Gly Arg Leu Lya Glu Aap Ser Arg Tyr lie Phe Phe Met Glu Pro Glu 
35 40 45 

GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 1 
Ala Lya Ser Ser Gly Gly Pro Gly Arg Leu Pro ser Leu Leu Pro Pro 
50 55 60 

TCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT GTG 2 
Ser Arg Aap Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 
65 70 75 80 
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CAA CCC TGC GCC TTG CCT CCC CGC TTG AAA GAG ATG AAG AGT CAG GAG 295 
Gin Arg Cys Ala Lau Pro Pro Arg Lau Lya Glu Mat Lya Sar Gin Clu 
85 90 95 

TCT GTG GCA GGT TCC AAA CTA GTG CTT CGG TGC GAG ACC AGT TCT GAA 343 
Sar Val Ala Gly Sar Lya Lau val Lau Arg Cya Glu Thr Sar Sar Glu 

100 105 110 

TAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT GAA TTA AGC 391 
Tyr Sar Sar Lau Lya Pha Lya Trp Pha Lya Aan Gly Sar Glu Lau Sar 

115 120 125 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 439 
Arg Lya Aan Lya Gly Gly Aan I la Lya Ila Gin Lya Arg Pro Gly Lya 

130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT 487 
Sar Glu Lau Arg Ila Sar Lya Ala Sar Lau Ala Aap Sar Gly Glu Tyr 

145 150 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 535 
Mat Cya Lya Val Ila Sar Lya Lau Gly Aan Aap Sar Ala Sar Ala Aan 

165 170 175 

ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG ACA 583 
Ila Thr Ila Val Glu Sar Aan Ala Thr Sar Thr Sar Thr Ala Gly Thr 
180 185 190 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 631 
Sar His Lau Val Lya Sar Ala Glu Lya Glu Lya Thr Pha Cya Val Aan 

195 200 205 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 679 
, Gly Gly Glu Cya Pha Mat Val Lya Aap Lau Sar Aan Pro Sar Arg Tyr 

210 215 220 

TTC TCC AAG TCC CAA CCT GGA TTC ACT GGA GCG ACA TGT ACT GAG AAT 727 
Lau Cya Lya Cya Gin Pro Gly Pha Thr Gly Ala Arg Cya Thr Glu Aan 
225 230 235 240 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AGT GCC CAA ATG AGT TTA CTG 775 
Val Pro Met Lya Val Gin Thr Gin Glu Sar Ala Gin Mat Sar Lau Lau 
245 250 255 

GTG ATC GCT GCC AAA ACT ACG TAATGGCCAG CTTCTACAGT ACGTCCACTC 826 
Val Ila Ala Ala Lya Thr Thr 
260 



CCTTTCTGTC 


TCTGCCTGAA 


TAGCGCATCT 


CAGTCGGTGC 


CGCTTTCTTG 


TTGCCGCATC 


886 


TCCCCTCAGA 


TTCCTCCTAG 


AGCTAGATGC 


GTTTTACCAG 


GTCTAACATT 


GACTGCCTCT 


946 


GCCTGTCGCA 


TGAGAACATT 


AACACAAGCG 


ATTGTATGAC 


TTCCTCTGTC 


CGTGACTAGT 


1006 


GGGCTCTGAG 


CTACTCGTAG 


GTGCGTAAGG 


CTCCAGTGTT 


TCTGAAATTG 


ATCTTGAATT 


1066 


ACTGTGATAC 


GACATGATAG 


TCCCTCTCAC 


CCAGTGCAAT 


GACAATAAAG 


GCCTTGAAAA 


1126 


GTCAAAAAAA 


AAAAAAAAAA 


AAAAAATCGA 


TGTCGACTCG 


ACATGTGGCT 


CCAGGTCCAC 


1186 


TCTAGAG 












1193 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 131: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1108 

(B) TYPE i nucleic acid 

(C) STRANDEDNESS : a ingle 

(D) TOPOLOGY i linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



CCTCCAG CAT CAA GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG 55 

His Gin Val Trp Ala Ala Lye Ala Gly Gly Lou Lye Lye Aap Ser Lau 

15 10 15 

CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 103 

Lau Thr Val Arg Lau Gly Ala Trp Gly Hia Pro Ala Phe Pro Ser Cya 
20 25 30 

GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 151 

Gly Arg Leu Lya Glu Aap Sar Arg Tyr I la Phe Pha Mat Glu Pro Glu 

35 40 45 



GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 199 
Ala Aan Sar Sar Gly Gly Pro Gly Arg Lau Pro Ser Lau Lau Pro Pro 
50 55 60 

TCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GCT GCT CTC 247 
Ser Arg Aap Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 
65 70 75 80 

CAA CGG TGC GCC TTG CCT CCC CGC TTG AAA GAG ATG AAG ACT CAG GAG 295 
Gin Arg Cya Ala Leu Pro Pro Arg Leu Lya Glu Met Lya Ser Gin Glu 
85 90 95 

TCT GTG CCA GGT TCC AAA CTA GTG CTT CGG TGC GAG ACC ACT TCT GAA 343 
Ser Val Ala Gly Ser Lya Leu Val Leu Arg Cya Glu Thr Ser Ser Glu 
100 105 110 

TAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG ACT GAA TTA AGC 391 
Tyr Ser Ser Leu Lya Phe Lya Trp Phe Lya Aan Gly Ser Glu Leu Ser 
115 120 125 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 439 
Arg Lya Aan Lya Pro Glu Aan lie Lya lie Gin Lya Arg Pro Pro Lya 

130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT 487 
Ser Glu Leu Arg He Ser Lya Ala Ser Leu Ala Aap Ser Gly Glu Tyr 
145 150 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 535 
Met Cya Lya Val He Ser Lya Leu Gly Aan Aap Ser Ala Ser Ala Aan 

165 170 175 

ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG ACA 583 
He Arg He Val Glu Ser Aan Ala Thr Ser Thr Sar Thr Ala Gly Thr 
180 185 190 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 631 
Sar Hia Leu Val Lya Cya Ala Glu Lya Glu Lya Thr Phe Cya Val Aan 
195 200 205 

GGA GCC GAG TGC TTC ATG GTC AAA CAC CTT TCA AAT CCC TCA AGA TAC 679 
Gly Gly Glu Cya Phe Met Val Lya Aap Leu Ser Aan Pro Ser Arg Tyr 
210 215 220 

TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 727 
Leu Cya Lya Cya Pro Aan Glu Pha Thr Gly Aap Arg Cya Gin Aan Tyr 
225 230 235 240 
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GTA ATG CCC AGC TTC TAC AGT ACG TCC ACT CCC TTT CTC TCT CTG CCT 
Val Met AX* Sar Phe Tyr Sar Thr Sar Thr Pro Phe Leu sar Lau Pr 

245 250 255 

GAA TAGCGCATCT CAOTCGGTGC CGCTTTCTTG TTGCCGCATC TCCCCTCACA TTCCCCCTAG 
Glu 

ACCTAGATGC GTTTTACCAC GTCTAACATT GACTGCCTCT CCCTGTCGCA TGAGAACATT 
AACACAAGCG ATTGTATGAC TTCCTCTGTC OGTGACTAGT GGGCTCTGAG CTACTCGTAG 
GTGCGTAAGG CTCCAGTGTT TCTGAAATTG ATCTTGAATT ACTCTGATAC GACATGATAC 
TCCCTCTCAC CCAGTGCAAT GACAATAAAG GCCTTGAAAA GTCAAAAAAA AAAAAAAAAA 
AAAAATCGAT GTCCACTCGA GATGTGGCTG 



775 

838 

898 

958 
1018 
1078 
1108 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(O) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N in position 214 is 

unknown . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

AGTTTCCCCC CCCAACTTGT CGGAACTCTG GGCTCGCCCG CAGGGCAGGA GCGGAGCGGC 60 

GGCGGCTGCC CAGGCGATGC GAGCGCGGGC CGGACGCTAA TCGCCTCTCC CTCCTCCGGC 120 

TGCGAGCGCG CCGGACCGAG GCAGCGACAG GAGCCGACCG CGGCGGGAAC CGAGGACTCC 180 

CCAGCGGCGC CCCAGCAGGA GCCACCCCCC GAGNCGTGCG ACCCGGACGG AGCGCCCGCC 240 

AGTCCCAGGT GGCCCGGACC GCACGTTGCG TCCCCGCGCT CCCCGCCGGC GACAGGAGAC 300 

GCTCCCCCCC ACGCCGCGCG CGCCTCGCCC CGGTCGCTGG CCCCCCTCCA CTCCGGGGAC 360 

AAACTTTTCC CGAAGCCGAT CCCAGCCCTC GGACCCAAAC TTGTCGCGCG TCGCCTTCGC 420 

CGGGAGCCGT CCGCGCAGAG CGTGCACTTC TCGGGCGAG ATG TCG GAG CGC AGA 474 

Mat Sar Glu Arg Arg 
X 5 

GAA GGC AAA GGC AAG GGG AAG GGC GGC AAG AAG GAC CGA GGC TCC GGG 522 
Glu Gly Lys Giy Lys Gly Lya Gly Gly Lys Lys Asp Arg Gly Sar Gly 
10 15 20 

AAG AAG CCC GTG CCC CCC GCT GGC GGC CCG AGC CCA G 559 
Lys Lys Pro Val Pro Ala Ala Gly Gly Pro Sar Pro Ala 
25 30 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 133: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 



252 

nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ix) FEATURE: 

(0) OTHER INFORMATION: N in position 8 could be 

either A or G. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

CC CAT CAN GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAC GAC TCG 47 
His Gin Vel Trp Ala Ala Lys Ala Gly Gly Leu Lye Lys Asp Ser 

15 10 15 

CTG CTC ACC GTG CCC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC 95 
Leu Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser 
20 25 30 

TCC GGG CCC CTC AAG GAG GAC AGC AGO TAC ATC TTC TTC ATG GAG CCC 143 
Cys Gly Arg Leu Lys Glu Asp ser Arg Tyr He Phe Phe Met Glu Pro 
35 40 45 

GAG GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC 191 
Glu Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro ser Leu Leu Pro 
SO 55 60 

CCC TCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT 239 
Pro Ser Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala 
65 70 75 

GTG CAA CGG TGC G 252 
Val Gin Arg Cys 
80 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 134: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENG TH: 178 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 134: 

CCT TGC CTC CCC GCT TGA AAG AGA TGA AGA GTC AGG AGT CTG TGG CAG 46 
Leu Pro Pro Arg Leu Lys Glu His Lys Ser Gin Glu Ser Val Ala Gly 
15 10 15 

GTT CCA AAC TAG TGC TTC GGT GCG AGA CCA GTT CTG AAT ACT CCT CTC 96 
Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr ser Ser Leu 
20 25 30 

TGA AGT TCA AGT GGT TGA AGA ATG GGA GTG AAT TAA GCC GAA AGA ACA 144 
Lys Phe Lys Trp Phe Lye Asn Gly Ser Glu Leu Ser Arg Lys Asn Lys 
35 40 45 

AAC CAC AAA ACA TCA AGA TAC AGA AAA GGC CGG G 178 
Pro Gly Asn He Lye He Gin Lys Arg Pro Gly 
50 55 

(2) INFORMATION FOR SEQUENCE IDENTIFICATI N NUMBER: 135: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTHS 122 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 135: 

G AAG TCA GAA CTT CCC ATT AGC AAA GCG TCA CTG OCT GAT TCT GGA 46 
Lys Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Sor Gly 

* 5 10 15 

GAA TAT ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT 94 
Glu Tyr Met Cym Lys Val lie Ser Lye Leu Gly Asn Asp Ser Ala Ser 

20 25 30 

GCC AAC ATC ACC ATT GTG GAG TCA AAC G 122 
Ala Aen lie Thr lie Val Glu Ser Aen Ala 
35 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

TCTAAAACTA CAGAGACTGT ATTTTCATGA TCATCATAGT TCTGTGAAAT ATACTTAAAC 60 

CGCTTTGGTC CTGATCTTGT AGG AAG TCA GAA CTT CCC ATT AGC AAA GCG 110 

Lye Ser Glu Leu Arg He Ser Lye Ala 
1 5 

TCA CTG CCT GAT TCT GGA GAA TAT ATG TGC AAA GTG ATC AGC AAA CTA 158 
Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu 

10 15 20 25 

GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG TCA AAC GOT 206 
Gly Asn Asp Ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn Gly 
30 35 40 

AAG AGA TGC CTA CTG CCT OCT ATT TCT CAG TCT CTA AGA GGA GTG ATC 254 
Lys Arg Cys Leu Leu Arg Ala He Ser Gin ser Leu Arg Gly Val He 
« 50 55 

AAG GTA TGT GGT CAC ACT TGAATCACGC AGGTGTGTGA AATCTCATTG 302 
Lys Val Cye Gly His Thr 
60 

TGAACAAATA AAAATCATGA AAGGAAAACT CTATCTTTGA AATATCTTAT GGGTCCTCCT 362 
GTAAAGCTCT TCACTCCATA AGCTCAAATA CACCTGAAAT ATATATAGAT TATTT 417 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 137: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

AC ATC ACC ACT GCC ATC CCA CCC TCA ACT GAG ACA GCG TAT GTG TCT 47 
Glu II© Thr Thr Gly Mat Pro Ala Sor Thr Glu Thr Ala Tyr Val Sor 
1 5 10 15 

TCA GAG TCT CCC ATT AGA ATA TCA GTA TCA ACA GAA GGA ACA AAT ACT 95 
Ser Glu Sor Pro II* Arg lie Ser val Ser Thr Glu Gly Thr Asn Thr 
20 25 30 

TCT TCA T 102 
Ser Ser Ser 

35 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 138: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

AAG TGC GAA CCT GGA TTC ACT GGA GCG AGA TCT ACT GAG AAT GTG CCC 48 
Lye Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Aen Val Pro 

15 10 15 

ATC AAA GTC CAA ACC CAA GAA 69 
Met Lye Val Gin Thr Gin Glu 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 139s 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 60 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG 
Lye eye Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met 

1 5 10 15 

GCC AGC TTC TAC 
Ala Ser Phe Tyr 

20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 140: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
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ACT ACG TCC ACT CCC TTT CTG TCT CTG CCT GAA TAG 36 

Ser Thr Ser Thr Pro Phe Leu Ser Leu Pr Glu 

1 5 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 141: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t a ingle 

(D) TOPOLOGY: Unaar 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AAG CAT CTT GGG ATT GAA TTT ATG GAG 27 
Lya His Lau Gly lis Glu Phe Mat Glu 

1 5 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 142: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH x 569 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : B ingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

AAA GCG GAG GAG CTC TAC CAG AAG AGA GTG CTC ACC ATT ACC GCC ATT 48 
Lya Ala Glu Glu Lau Tyr Gin Lya Arg Val Lau Thr Ila Thr Gly I la 

1 5 10 15 

TGC ATC GCG CTG CTC GTG CTT GGC ATC ATG TCT GTG GTG CTC TAC TGC 96 
Cya Ila Ala Lau Lau Val Val Gly He Mat Cye Val Val Val Tyr Cya 
20 25 30 

AAA ACC AAG AAA CAA CGG AAA AAG CTT CAT GAC CGG CTT CGG CAG AGC 144 
Lya Thr Lya Lya Gin Arg Lya Lya Leu Hia Asp Arg Leu Arg Gin Sar 
35 40 45 

CTT CGG TCT GAA AGA AAC ACC ATG ATG AAC GTA GCC AAC GGG CCC CAC 192 
Leu Arg Ser Glu Arg Aan Thr Met Met Aan Val Ala Aan Gly Pro Hia 
SO 55 60 

CAC CCC AAT CCC CCC CCC GAG AAC GTG CAG CTG GTG AAT CAA TAC GTA 240 
Hia Pro Aan Pro Pro Pro Glu Aan Val Gin Lau Val Aan Gin Tyr Val 
65 70 75 80 

TCT AAA AAT GTC ATC TCT AGC GAG CAT ATT GTT GAG AGA CAG GCG GAG 288 
Ser Lya Aan Val He Ser Ser Glu Hia He Val Glu Arg Glu Ala Glu 
85 90 95 

AGC TCT TTT TCC ACC ACT CAC TAC ACT TCC AGA GCT CAT CAT TCC ACT 336 
Ser Ser Phe Ser Thr Ser Hia Tyr Thr Ser Thr Ala Hia Hia Ser Thr 

100 105 110 

ACT GTC ACT CAG ACT CCC AGT CAC AGC TGG AGC AAT GGA CAC ACT GAA 384 
Thr Val Thr Gin Thr Pro Ser Hia Ser Trp Ser Aan Gly Hia Thr Glu 
115 120 125 

AGC ATC ATT TCG GAA AGC CAC TCT GTC ATC GTG ATG TCA TCC GTA GAA 432 
Ser He He ser Glu Ser Hia Ser Val He Val Met Ser Ser Val Glu 

130 135 140 
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AAC AGT AGG CAC AGC AGC CCG ACT CCG GGC CCG AGA GGA CGT CTC AAT 480 
Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg GXy Arg Lou Aan 
145 150 155 160 

GGC TTG GGA GGC CCT CGT GAA TGT AAC AGC TTC CTC AGG CAT GCC AGA 528 
Gly Leu Gly Gly Pro Arg Glu Cys Asn Ser Phe Lou Arg His Ala Arg 
165 170 175 

GAA ACC CCT GAC TCC TAC CGA GAC TCT CCT CAT AGT G AAAG 569 
Glu Thr Pro Asp Sor Tyr Arg Asp Sor Pro His sor 
180 185 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 143: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 

(B) TYPE: nucloic acid 

(C) STRANDEDNESS: singlo 
(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

G TAT GTA TCA GGA ATG ACC ACC CCG GCT CCT ATG TCA CCT CTA GAT 46 
Tyr Val Sor Ala Mot Thr Thr Pro Ala Arg Met Sor Pro Val Asp 
15 10 15 

TTC CAC ACG CCA AGC TCC CCC AAG TCA CCC CCT TCG GAA ATG TCC CCG 94 
Phe His Thr Pro Sor Sor Pro Lys Sor Pro Pro Sor Glu Mot Sor Pro 
20 25 30 

CCC GTG TCC AGC ACG AOG GTC TCC ATG CCC TCC ATG GCG GTC AGT CCC 142 
Pro Val Sor Sor Thr Thr Val Sor Mot Pro Sor Mot Ala Val Sor Pro 
35 40 45 

TTC GTG GAA GAG GAG AGA CCC CTC CTC CTT GTG ACG CCA CCA CCG CTG 190 
Pho Val Glu Glu Glu Arg Pro Lou Lou Lou Val Thr Pro Pro Arg Lou 
50 55 60 

CGG GAG AAG TAT CAC CAC CAC GCC CAC CAA TTC AAC TCG TTC CAC TCC 238 
Arg Glu Lys Tyr Asp His His Ala Gin Gin Pho Asn Sor Phe His Cys 

65 70 75 

AAC CCC GCG CAT GAG AGC AAC AGC CTG CCC CCC AGC CCC TTG AGC ATA 286 
Asn Pro Ala His Glu Sor Asn Sor Lou Pro Pro Sor Pro Leu Arg lie 
80 85 90 95 

GTG GAG GAT GAG GAA TAT GAA ACG ACC CAG GAG TAC GAA CCA GCT CAA 334 
Val Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin 
100 105 110 

GAG CCG GTT AAG AAA CTC ACC AAC AGC AGC CCG CGG GCC AAA AGA ACC 382 
Glu Pro Val Lys Lys Leu Thr Aan Sor Sor Arg Arg Ala Lys Arg Thr 
115 120 125 

AAG CCC AAT GCT CAC ATT GCC CAC AGG TTG CAA ATG GAC AAC AAC ACA 430 
Lys Pro Asn Gly His He Ala His Arg Leu Glu Met Asp Aan Asn Thr 
130 135 140 

GGC GCT GAC AGC AGT AAC TCA GAG AGC CAA ACA GAG GAT GAA AGA GTA 478 
Gly Ala Asp Ser Sor Asn Ser Glu Ser Glu Thr Glu Asp Glu Arg Val 
145 150 155 

GGA GAA GAT ACC CCT TTC CTG GCC ATA CAG AAC CCC CTG CCA GCC AGT 526 
Glv Glu Asp Thr Pro Phe Leu Ala He Gin Asn Pro Leu Ala Ala Ser 
160 165 170 175 
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CTC GAG GCG GCC CCT GCC TTC CGC CTG GTC GAC AGC AGG ACT AAC CCA 574 
Leu Glu Ala Ala Pro Ala Phe Arg Leu Val Aap Ser Arg Thr Aan Pro 
180 185 190 

ACA GGC GGC TTC TCT CCG CAG GAA GAA TTG CAG GCC AGG CTC TCC GGT 622 
Thr Gly Gly Pha Sar Pro Gin Glu Glu Lau Gin Ala Arg Lau Sar Gly 
195 200 205 

GTA ATC GCT AAC CAA GAC CCT ATC GCT GTC TAAAACCGAA ATACACCCAT 672 

Val I la Ala Aan Gin Aap Pro lie Ala Val 

210 215 

AGATTCACCT GTAAAACTTT ATTTTATATA ATAAAGTATT CCACCTTAAA TTAAACAA 730 

(2) INFORMATION FOR SEQUENCE IDEKTIFI CATION NUMBER: 144: 
(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 1654 

(B) TYPE: nuclaic acid 

(C) STRAND ED NESS : alngla 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

AGTTTCCCCC CCCAACTTGT CGGAACTCTG GGCTCGCGCG CAGGGCAGGA GCGGAGCGGC 60 

GGCGCCTCCC CAGGCGATGC GAGCGCGGGC CGGACGGTAA TCGCCTCTCC CTCCTCGGGC 120 

TGCGAGCGCG CCGGACCGAG GCAGCGACAG GAGCGGACCG CGGCCGGAAC CGAGGACTCC 180 

CCAGCGGCGC CCCAGCAGGA GCCACCCCGC GAGCGTGCGA CCGGGACGGA GCGCCCGCCA 240 

GTCCCAGCTG GCCCGGACCG CACGTTGCGT CCCCGCGCTC CCCGCCGGCG ACAGGAGACG 300 

CTCCCCCCCA CGCCGCGCGC GCCTCGGCCC GGTCGCTGGC CCGCCTCCAC TCCGGCGACA 360 

AACTTTTCCC GAAGCCGATC CCAGCCCTCG GACCCAAACT TGTCGCGCGT CGCCTTCGCC 420 

GGGAGCCGTC CGCGCAGAGC GTGCACTTCT CGGGCG AG ATG TOG GAG CGC AGA 473 

Hat Ser Glu Arg Arg 

1 5 

GAA GGC AAA GGC AAG GGG AAG GGC GGC AAG AAG GAC CCA GGC TCC GGG 521 
Glu Gly Lya Gly Lya Gly Lya Gly Gly Lya Lya Aap Arg Gly Sar Gly 
10 15 20 

AAG AAG CCC GTG CCC GCG CCT GGC GGC CCG AGC CCA GCC TTG CCT CCC 569 
Lya Lya Pro Val Pro Ala Ala Gly Gly Pro Sar Pro Ala Lau Pro Pro 
25 30 35 

CGC TTG AAA GAG ATG AAG ATG CAG GAG TCT GTG CCA GGT TCC AAA CTA 617 
Arg Lau Lya Glu Mat Lya Sar Gin Glu sar Val Ala Gly Sar Lya Lau 
40 45 SO 

GTG CTT CGC TGC GAG ACC AGT TCT GAA TAC TCC TCT CTC AAG TTC AAG 665 
Val Leu Arg Cya Glu Thr Ser Sar Glu Tyr Sar Ser Leu Lya Phe Lya 
55 60 65 

TGG TTC AAG AAT GGG AGT GAA TTA AGC CCA AAG AAC AAA CCA CAA AAC 713 
Trp Phe Lya Aan Gly Ser Glu Leu Ser Arg Lya Aan Lya Pro Gin Aan 
70 75 80 85 

ATC AAG ATA CAG AAA AGG CCC GGG AAG TCA GAA CTT CGC ATT AGC AAA 761 
lie Lya lie Gin Lya Arg Pro Gly Lya sar Glu Lau Arg lie Ser Lya 
90 95 100 
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GCG TCA CTG GCT GAT TCT GGA GAA, TAT ATG TGC AAA GTG ATC AGC AAA 609 
Ala Ser Leu Ala Asp ser Gly Glu Tyr Met Cys Lye Val lie Ser Lys 

105 110 115 

CTA GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG TCA AAC 657 
Leu Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn 
120 125 130 

GAG ATC ACC ACT GGC ATG CCA GCC TCA ACT GAG ACA GCG TAT GTG TCT 905 
Glu He Thr Thr Gly Met Pro Ala Ser Thr Glu Thr Ala Tyr Val Ser 
135 140 145 

TCA GAG TCT CCC ATT AGA ATA TCA GTA TCA ACA GAA GGA ACA AAT ACT 953 
Ser Glu ser Pro He Arg lie Ser Val Ser Thr Glu Gly Thr Asn Thr 
150 155 160 165 

TCT TCA TCC ACA TCC ACA TCT ACA GCT GGG ACA AGC CAT CTT GTC AAG 1001 
Ser Ser Ser Thr Ser Thr Ser Thr Ala Gly Thr Ser His Leu Val Lys 

170 175 180 

TCT GGA GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGC GAG TGC TTC 1049 
Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe 
185 190 195 

ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC TTG TGC AAG TGC CCA 1097 
Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro 
200 205 210 

AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC 1145 
Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
215 220 225 



TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT GAA TAGGCGCATG 1191 
Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
230 235 240 



CTCAGTCGGT 


GCCGCTTTCT 


TGTTGCCGCA 


TCTCCCCTCA 


GATTCAACCT 


AGAGCTAGAT 


1251 


GCG X III AC C 


AGGTCTAACA 


TTGACTGCCT 


CTGCCTGTCG 


CATGAGAACA 


TTAACACAAG 


1311 


CGATTGTATG 


ACTTCCTCTG 


TCCGTGACTA 


GTGGGCTCTG 


AGCTACTCCT 


AGGTGCGTAA 


1371 


GGCTCCAGTG 


TTTCTGAAAT 


TGATCTTGAA 


TTACTGTGAT 


ACGACATGAT 


AGTCCCTCTC 


1431 


ACCCAGTGCA 


ATGACAATAA 


AGCCCTTGAA 


AAGTCTCACT 


TTTATTGAGA 


AAATAAAAAT 


1491 


CGTTCCACGG 


GACAGTCCCT 


CTTCTTTATA 


AAATGACCCT 


ATCCTTGAAA 


AGGAGGTGTG 


1551 


TTAAGTTGTA 


ACCAGTACAC 


ACTTGAAATG 


ATGGTAAGTT 


CGCTTCGGTT 


CAGAATGTGT 


1611 


TCTTTCTGAC 


AAATAAACAG 


AATAAAAAAA 


AAAAAAAAAA 


A 




1652 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 145: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1140 

(8) TYPE: nucleic acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
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CAT CAN GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG 48 
His Gin Val Trp Ala Ala Lya Ala Gly Gly Lau Lya Lys Asp Ser Lau 

15 10 15 

CTC ACC GTG CGC CTG GGC GCC TCG GGC GAC CCC GCC TTC CCC TCC TGC 96 
Lau Thr Val Arg Lau Gly Ala Trp Gly Hia Pro Ala Phe Pro Ser Cya 
20 25 30 

GGG CGC CTC AAG GAG GAC AGC AGO TAC ATC TTC TTC ATG GAG CCC GAG 144 
Gly Arg Lau Lys Glu Asp Ser Arg Tyr He Phe Phe Met Glu Pro Glu 
35 40 45 

CCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 192 
Ala Aan Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro Pro 
50 55 60 

TCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GOT GCT GTG 240 
Ser Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 
65 70 75 80 

CAA CGG TGC GCC TTG CCT CCC CGC TTG AAA GAG ATG AAG AGT CAG GAG 288 
Gin Arg Cya Ala Leu Pro Pro Arg Leu Lya Glu Met Lya Ser Gin Glu 
85 90 95 

TCT GTG GCA GCT TCC AAA CTA CTG CTT CGG TCC GAG ACC AGT TCT GAA 336 
Ser Val Ala Gly Ser Lya Leu Val Leu Arg Cya Glu Thr Ser Ser Glu 
100 105 110 

TAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT CAA TTA ACC 384 
Tyr Ser Ser Leu Lya Phe Lys Trp Phe Lya Aan Gly Ser Glu Leu Ser 
115 120 125 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 432 
Arg Lya Aan Lya Pro Glu Aan He Lya He Gin Lya Arg Pro Gly Lya 
130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCC TCA CTG CCT GAT TCT GGA GAA TAT 480 
Ser Glu Leu Arg He Ser Lya Ala Ser Leu Ala Aap Ser Gly Glu Tyr 
145 150 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 528 
Met Cya Lya Val He Ser Lya Leu Gly Aan Asp Ser Ala Ser Ala Aan 

165 170 175 

ATC ACC ATT GTG GAG TCA AAC GCC AGA TCC AGA TCT ACA GCT GGG AGA 576 
He Thr He Val Glu Ser Aan Ala Thr Ser Thr Ser Thr Ala Gly Thr 
180 185 190 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 624 
Ser Hia Leu Val Lya Cya Ala Glu Lya Glu Lya Thr Phe Cya Val Aan 
195 200 205 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 672 
Gly Gly Glu Cya Phe Met Val Lya Aap Leu Ser Aan Pro Ser Arg Tyr 
210 215 220 

TTG TCC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 720 
Leu Cya Lya Cys Gin Pro Gly Phe Thr Gly Ala Arg Cya Thr Glu Aan 
225 230 235 240 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TCC CCA AAT GAG TTT ACT 768 
Val Pro Met Lya Val Gin Thr Gin Glu Lys Cya Pro Aan Glu Phe Thr 
245 250 255 

GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AGT ACG TCC 816 
ly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser 
260 265 270 
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ACT CCC TTT CTC TCT CTC CCT GAA TAGCGCATCT CACTCGCTCC CCCTT7CT7C 870 
Thr Pro Phe Leu ser Leu Pro Glu 
275 280 



TTCCCGCATC 


TCCCCTCAGA 


TTCCNCCTAG 


AGCTAGATGC 


CTTTTACCAG 


GTCTAACATT 


930 


GACTCCCTCT 


GCCTGTCGCA 


TGAGAACATT 


AACACAAGCG 


ATTGTATGAC 




990 


CGTGACTAGT 


GGGCTCTGAG 


CTACTCGTAG 


GTGCGTAAGG 


CTCCAGTGTT 


TCTGAAATTG 


10S0 


ATCTTGAATT 


ACTGTGATAC 


GACATGATAG 


TCCCTCTCAC 


CCAGTGCAAT 


GACAATAAAG 


1110 


GCCTTGAAAA 


GTCAAAAAAA AAAAAAAAAA 








1140 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 146s 
(i) SEQUENCE CHARACTERISTICS s 

(A) LENG TH; 1764 

(B) TTFEi nucleic acid 
<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

G AAG TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA 49 
Ly» Ser Glu Leu Arg I la Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu 

15 10 15 

TAT ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC 97 
Tyr Met Cya Lys val Ila ser Lys Leu Gly Aan Aap Sar Ala Ser Ala 
20 25 30 

AAC ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG 145 
Aan Ila Thr Ila Val Glu Ser Aan Ala Thr Sar Thr Sar Thr Ala Gly 
35 40 45 

ACA AGC CAT CTT GTC AAG TCT CCA GAG AAG GAG AAA ACT TTC TGT GTG 193 
Thr Sar Hia Leu Val Lya Cya Ala Glu Lya Glu Lya Thr Phe Cya Val 
50 55 60 

AAT GGA CGC GAC TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA 241 
Aan Gly Gly Aap Cya Phe Met Val Lya Aap Leu Ser Aan Pro Ser Arg 
65 70 75 80 

TAC TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG 289 
Tyr Leu Cya Lya Cya Gin Pro Gly Phe Thr Gly Ala Arg Cya Thr Glu 
85 90 95 

AAT GTG CCC ATG AAA GTC CAA ACC CAA GAA AAA GCG GAG GAG CTC TAC 337 
Aan Val Pro Mat Lya Val Gin Thr Gin Glu Lya Ala Glu Glu Leu Tyr 
100 105 110 

GAG AAG AGA GTG CTC ACC ATT ACC GGC ATT TGC ATC GCG CTG CTC GTG 385 
Gin Lya Arg Val Leu Thr He Thr Gly He Cya He Ala Leu Leu Val 
115 120 125 

GTT GGC ATC ATG TGT GTG GTG GTC TAC TGC AAA ACC AAG AAA CAA CGC 433 
Val Gly He Met Cya Val Val Val Tyr Cya Lya Thr Lys Lys Gin Arg 
130 135 140 

AAA AAG CTT CAT GAC CGC CTT CGG CAG AGC CTT CGG TCT GAA AGA AAC 481 
Lys Lys Leu Hia Asp Arg Leu Arg Gin ser Leu Arg Ser Glu Arg Asn 
145 150 155 160 
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ACC ATC ATG AAC CTA GCC AAC CCC CCC CAC CAC CCC AAT CCC CCC CCC 529 
Thr Met Met Asn Val Ala Asn Gly Pro Hi* His Fro Aan Pro Pro Pro 

165 170 175 

GAG AAC GTG GAG CTG GTO AAT GAA TAC GTA TCT AAA AAT GTC ATC TCT 577 
Glu Asn Val Gin Leu Val Asn Gin Tyr Val Sar Lya Asn Val lie Ser 
180 185 190 

AGC GAG CAT ATT GTT GAG AGA GAG GCG GAG AGC TCT TTT TCC ACC AGT 625 
Sar Glu His I la Val Glu Arg Glu Ala Glu ser Sar Pha Sar Thr Sar 
195 200 205 

CAC TAC ACT TCG AGA OCT CAT CAT TCC ACT ACT GTC ACT CAG ACT CCC 673 
His Tyr Thr Sar Thr Ala His His Sar Thr Thr Val Thr Gin Thr Pro 
210 215 220 

AGT CAC AGC TCC AGC AAT GGA CAC ACT GAA AGC ATC ATT TCG GAA AGC 721 
Sar His Sar Trp Sar Asn Gly His Thr Glu Sar lie 21a Sar Glu Sar 

225 230 235 240 

CAC TCT GTC ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC AGC AGC 769 
His Sar Val lie Val Mat Sar Sar Val Glu Asn Ser Arg His Ser Sar 

245 250 255 

CCG ACT GGG GGC CCC AGA GGA CCT CTC AAT GGC TTG GGA GGC CCT CGT 817 
Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Leu Gly Gly Pro Arg 
260 265 270 

GAA TGT AAC AGC TTC CTC AGG CAT GCC AGA GAA ACC CCT GAC TCC TAC 865 
Glu Cys Asn Sar Pha Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr 
275 280 285 

CGA GAC TCT CCT CAT AGT GAA AGA CAT AAC CTT ATA GCT GAG CTA AGG 913 
Arg Asp Ser Pro His Ser Glu Arg His Asn Leu lie Ala Glu Leu Arg 
290 295 300 

AGA AAC AAC GCC CAC AGA TCC AAA TGC ATG CAG ATC CAG CTT TCC GGA 961 
Arg Asn Lys Ala His Arg Ser Lys Cys Met Gin lie Gin Leu Ser Ala 
305 310 315 320 

ACT CAT CTT AGA GCT TCT TCC ATT CCC CAT TGG GCT TCA TTC TCT AAG 1009 
Thr His Leu Arg Ala Ser Ser He Pro His Trp Ala Ser Phe Ser Lys 
325 330 335 

ACC CCT TGG CCT TTA GGA AGG TAT GTA TCA CCA ATG ACC ACC CCG GCT 1057 
Thr Pro Trp Pro Leu Gly Arg Tyr Val Ser Ala Met Thr Thr Pro Ala 
340 345 350 

CGT ATG TCA CCT GTA GAT TTC CAC ACC CCA AGC TCC CCC AAG TCA CCC 1105 
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro 
355 360 365 

CCT TCG GAA ATG TCC CCG CCC GTG TCC AGC ACC ACG CTC TCC ATG CCC 1153 
Pro Ser Glu Met Ser Pro Pro Val Ser Ser Thr Thr Val Ser Met Pro 
370 375 380 

TCC ATG GCG GTC AGT CCC TTC CTG GAA GAG GAG AGA CCC CTC CTC CTT 1201 
Ser Met Ala Val Ser Pro Phe Val Glu Glu Glu Arg Pro Leu Leu Leu 
385 390 395 400 

CTG ACG CCA CCA CCG CTG CCC GAC AAC TAT CAC CAC CAC GCC CAC CAA 1249 
Val Thr Pro Pro Arg Leu Arg Glu Lys Tyr Asp His His Ala Gin Gin 
405 410 415 

TTC AAC TCG TTC CAC TGC AAC CCC GCG CAT CAC ACC AAC ACC CTC CCC 1297 
Phe Asn Ser Phe His Cys Asn Pro Ala His Glu Ser Asn Ser Leu Pro 
420 425 430 
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CCC AGC CCC TTG AGG ATA GTG GAG GAT GAG GAA TAT GAA ACG ACC GAG 1345 

Pro Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu Thr Thr Gin 
435 440 445 

GAG TAC GAA CCA GCT CAA GAG CCG GTT AAG AAA CTC ACC AAC AGC AGC 1393 
Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lye Leu Thr Aan Ser Ser 
450 455 460 

CGG CGG GCC AAA AGA ACC AAG CCC AAT GGT CAC ATT GCC CAC AGG TTG 1441 
Arg Arg Ala Lys Arg Thr Lys Pro Aan Gly His lie Ala His Arg Leu 
465 470 475 480 

GAA ATG GAC AAC AAC ACA GGC GCT GAC AGC AGT AAC TCA GAG AGC GAA 1489 
Glu Met Asp Asn Asn Thr Gly Ala Asp Ser Ser Aan Ser Glu Ser Glu 
485 490 495 

ACA GAG GAT GAA AGA GTA GGA GAA GAT ACG CCT TTC CTG GCC ATA CAG 1537 
Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Ala He Gin 
S00 505 510 

AAC CCC CTC GGA GCC AGT CTC GAG GCC GCC CCT GCC TTC CGC CTG GTC 1585 
Asn Pro Leu Ala Ala Ser Leu Glu Ala Ala Pro Ala Phe Arg Leu val 
515 520 525 

GAC AGC AGG ACT AAC CCA ACA GGC GGC TTC TCT CCG CAG GAA GAA TTG 1633 
Asp Ser Arg Thr Asn Pro Thr Gly Gly Phe Ser Pro Gin Glu Glu Leu 
530 535 540 

CAG GCC AGG CTC TCC GGT GTA ATC GCT AAC CAA GAC CCT ATC GCT GTC 1681 
Gin Ala Arg Leu Ser Gly Val He Ala Asn Gin Asp Pro He Ala Val 
545 550 555 560 

TAAAACCGAA ATACACCCAT AGATTCACCT GTAAAACTTT ATTTTATATA ATAAAGTATT 1741 

CCACCTTAAA TTAAACAAAA AAA 1764 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 147: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 50 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

Lye Cys Ala Glu Lys Glu Lye Thr Phe Cys Val Asn Gly Gly Glu Cys 

1 5 10 15 

Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys 

20 25 30 

Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Aon Tyr Val Met Ala Ser 
35 40 45 

Phe Tyr 

50 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 148: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 

(B) TYPE: amino acid 
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<C) STRANDED NESS I 

<D) TOPOLOGY x linear 

(Xi) SEQUENCE DESCRIPTI N: SEQ ID NO: 148: 

Lye Cya Ala Clu Lye Glu Lya Thr Pha Cya Val Aan Gly Cly Glu Cya 

1 5 io is 

Pha Mat Val Lya Aap Lau Ser Aan Pro Ser Arg Tyr Leu Cya Lya Cva 

20 25 30 

Gin Pro Gly Phe Thr Gly Ala Arg Cya Thr Glu Aan Val Pro Mat Lya 
35 40 45 

Val Gin 

50 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 149: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 

(B) TYPE: amino acid 
<C) STRANDED NESS : 

(D) TOPOLOGY: linaar 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Glu Cya Lau Arg Lya Tyr Lya Aap Pha Cya Ila Hia Cly Glu Cya Lya 

1 5 10 15 

Tyr Val Lya Glu Lau Arg Ala Pro Sar Cya Lya Cya Gin Gin Glu Tyr 
20 25 30 

Pha Cly Glu Arg Cya Gly Glu Lya Sar Aan Lya Thr Hia Sar 
35 40 45 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 150: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 

(B) TYPEs nuclaic acid 

(C) STRANDED NESS : aingla 

(D) TOPOLOGY: linaar 

(xi) SEQUENCE DESCRIPTION t SEQ ID NO: 150: 

AGC CAT CTT GTC AAG TGT CCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 
Sar Hia Lau Val Lya Cya Ala Glu Lya Glu Lya Thr Pha Cya Val Aan 

15 10 15 

GGA GGC GAG TCC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cya Pha Mat Val Lya Aap Lau Sar Aan Pro Sar Arg Tyr 
20 25 30 

TTG TGC AAG TCC CCA AAT GAG TTT ACT GGT GAT CGC TCC CAA AAC TAC 144 
Lau Cya Lya Cya Pro Aan Glu Pha Thr Gly Aap Arg Cya Gin Aan Tyr 
35 40 45 

GTA ATG GCC AGC TTC TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT 192 
Val Mat Ala Ser Pha Tyr Sar Thr Ser Thr Pro Phe Lau Ser Lau Pro 
50 55 60 
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GAA TAG 198 

Glu 

65 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 151: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS i eingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

AGC CAT CTT GTC AAG TGT CCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 
Ser His Leu Val Lye Cye Ala Glu Lye Glu Lye Thr Phe Cye Val Aen 
15 10 15 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cye Phe Met Val Lye Aep Leu Ser Aen Pro Ser Arg Tyr 
20 25 30 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 144 
Leu Cye Lye Cye Gin Pro Gly Phe Thr Gly Ala Arg Cye Thr Glu Aen 
35 40 45 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AAA GCG GAG GAG CTC TAC TAA 192 
Val Pro Met Lye Val Gin Thr Gin Glu Lye Ala Glu Glu Leu Tyr 
50 55 60 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 152: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 183 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: ■ ingle 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 
Sex Hie Leu Val Lye Cye Ala Glu Lye Glu Lye Thr Phe Cye Val Aen 
15 10 15 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cye Phe Met Val Lye Aep Leu Ser Aen Pro Ser Arg Tyr 
20 25 30 

TTC TGC AAG TGC CCA AAT GAG TTT ACT GOT CAT CGC TGC CAA AAC TAC 144 
Cye Lye Cye Pro Aen Glu Phe Thr Gly Aep Arg Cye Gin Aen Tyr 
35 40 45 

CTA ATG GCC AGC TTC TAC AAA GCG GAG GAG CTC TAC TAA 183 
Val Met Ala Ser Phe Tyr Lye Ala Glu Glu Leu Tyr 
50 55 60 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 153: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS: single 
(D) TOPOLOGY* linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

AGC CAT CTT CTC AAG TCT CCA GAG AAG GAG AAA ACT TTC TOT GTG AAT 48 
Ser Hia Leu Val Lye Cya Ala Glu Lya Glu Lya Thr Phe Cya Val Aan 

1 5 10 15 

GGA GGC GAG TGC TTC ATC GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Giy Cly Glu Cya Phe Met Val Lye Aap Leu Ser Aan Pro Ser Arg Tyr 
20 25 30 

TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 144 
Leu eye Lya Cya Pro Aan Glu Phe Thr Gly Aap Arg Cya Gin Aan Tyr 
35 40 45 

GTA ATG CCC AGC TTC TAC AAG CAT CTT GGG ATT GAA TTT ATC GAG AAA 192 
Val Met Ala Ser Phe Tyr Lya Hia Leu Gly He Glu Phe Met Glu Lya 
50 55 60 

GCG GAG GAG CTC TAC TAA 210 
Ala Glu Glu Leu Tyr 

65 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 154: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 267 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: X54: 

AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TOT GTG AAT 48 
Ser Hia Leu Val Lya Cya Ala Glu Lya Glu Lya Thr Phe Cya Val Aan 

1 5 10 15 

GGA GGC GAG TGC TTC ATG GTC AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cya Phe Met Val Lya Aap Leu Ser Aan Pro Ser Arg Tyr 
20 25 30 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 144 
Leu Cya Lya Cya Gin Pro Gly Phe Thr Gly Ala Arg Cya Thr Glu Aan 
35 40 45 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TGC CCA AAT GAG TTT ACT 192 
Val Pro Met Lya Val Gin Thr Gin Glu Lya Cya Pro Aan Glu Phe Thr 
50 55 60 

GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AGT ACG TCC 240 
Gly Aap Arg Cya Gin Aan Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser 

65 70 75 80 

ACT CCC TTT CTG TCT CTG CCT GAA TAG 267 
Thr Pro Phe Leu Ser Leu Pro Glu 
85 

(2> INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 155: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 252 
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(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

ACC CAT CTT GTC AAG TOT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 
Ser Hie Leu Val Lye Cya Ala Glu Lye Glu Lys Thr Phe Cya Val A«n 
15 10 15 

GGA GCC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cya Pho Met Val Lya Asp Leu Ser Aan Pro Ser Arg Tyr 
20 25 30 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 144 
Leu Cya Lya Cya Gin Pro Gly Phe Thr Gly Ala Arg Cya Thr Glu Aan 
35 40 45 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TGC CCA AAT GAG TTT ACT 192 
Val Pro Met Lya Val Gin Thr Gin Glu Lya Cya Pro Aan Glu Phe Thr 
50 55 60 

GGT GAT CGC TGC CAA AAC TAC CTA ATG GCC AGC TTC TAC AAA GCG GAG 240 
Gly Asp Arg Cya Gin Aan Tyr Val Met Ala Ser Phe Tyr Lya Ala Glu 
65 70 75 80 

GAG CTC TAC TAA 252 
Glu Leu Tyr 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 156: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : aingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

CC ACA TCC ACA TCT ACA GCT GGG ACA AGC CAT CTT GTC AAC TGT GCA 47 
Thr Ser Thr Ser Thr Ala Gly Thr Ser His Leu Val Lya Cya Ala 

15 10 15 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGC GAG TGC TTC ATG GTC 95 
Glu Lya Glu Lya Thr Phe Cya Val Aan Gly Gly Glu Cya Phe Met Val 
20 25 30 

AAA GAC CTT TCA AAT CCC TCA ACA TAC TTG T GC ^ 
Lya Aap Leu Ser Aan Pro Ser Arg Tyr Leu 

35 40 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 157: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 
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A CAT AAC CTT ATA OCT GAG CTA AGG AGA AAC AAG GCC CAC AGA TCC 46 
His Asn Lsu lis Ala Glu Leu Arg Arg Asn Lys Ala His Arg Ssr 

1 5 10 15 

AAA TGC ATG GAG ATC CAG CTT TCC GCA ACT CAT CTT AGA GCT TCT TCC 94 
Lys Cys Mst Gin lis Gin Lsu Ser Ala Thr His Lsu Arg Ala Ser Ssr 

20 25 30 

ATT CCC CAT TGG GCT TCA TTC TCT AAG ACC CCT TGG CCT TTA GGA AG 141 
lis Pro His Trp Ala Ssr Phs Ssr Lys Thr Pro Trp Pro Lsu Gly Arg 
3S 40 45 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 158: 
(i) SEQUENCE CHARACTERISTICS s 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

(D) TOPOLOGY s linsar 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in positions 15 and 22 

is unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

Ala Ala Glu Lys Glu Lys Thr Phs Cys Val Asn Gly Gly Glu Xaa Phs 

15 10 15 

Mat Val Lys Asp Lsu Xaa Asn Pro 

20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 

<B) TYPE: nuclsic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linsar 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

ATG AGA TGG CGA CGC GCC CCG CCC CGC TCC GGG CCT CCC GGC CCC CGG 48 
Mst Arg Trp Arg Arg Ala Pro Arg Arg Ser Gly Arg Pro Gly Pro Arg 
1 5 10 15 

GCC CAG CGC CCC GGC TCC GCC GCC CGC TCG TCC CCG CCG CTG CCG CTG 96 
Ala Gin Arg Pro Gly Ssr Ala Ala Arg Ssr Ssr Pro Pro Lsu Pro Lsu 
20 25 30 

CTG CCA CTA CTG CTG CTG CTG GGG ACC GCG GCC CTG GCG CCG GGG GCG 144 
Lsu Pro Lsu Lsu Lsu Lsu Lsu Gly Thr Ala Ala Lsu Ala Pro Gly Ala 
35 40 45 

GCG CCC GGC AAC GAG GCG GCT CCC GCG GGG GCC TCG CTG TGC TAC TCG 192 
Ala Ala Gly Asn Glu Ala Ala Pro Ala Gly Ala Ser Val Cys Tyr ssr 
5P 55 60 

TCC CCG CCC ACC GTG GGA TCG GTG CAG GAG CTA GCT CAG CGC GCC GCG 240 
Ssr Pro Pro Ssr Val Gly Ssr Val Gin Glu Lsu Ala Gin Arg Ala Ala 
65 70 75 80 
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GTG GTG ATC GAG GGA AAG GTG CAC CCG CAG CGG CGG CAG GAG GGG GCA 288 
Val Val lie Glu Gly Lya Val His Pro Gin Arg Arg Gin Gin Gly Ala 
85 90 95 

CTC GAC AGG AAG GCG GCG GCG GCG GCG GGC GAG GGA GGG GCG TGG GGC 336 
Lau Asp Arg Lys Ala Ala Ala Ala Ala Gly Glu Ala Gly Ala Trp Gly 
100 105 110 

GGC GAT CGC GAG CCG CCA GCC GCG GGC CCA CGG CCG CTG GGG CCG CCC 384 
Gly Asp Arg Glu Pro Pro Ala Ala Gly Pro Arg Ala Lau Gly Pro Pro 
115 120 125 

GCC GAG GAG CCG CTC CTC GCC GCC AAC GGG ACC GTG CCC TCT TGG CCC 432 
Ala Glu Glu Pro Lau Lau Ala Ala Asn Gly Thr Val Pro Ssr Trp Pro 
130 135 140 

ACC GCC CCG GTG CCC ACC GCC GGC GAG CCC GGG GAG GAG GCG CCC TAT 480 
Thr Ala Pro Val Pro Sar Ala Gly Glu Pro Gly Glu Glu Ala Pro Tyr 
145 150 155 160 

CTG GTG AAG GTG CAC CAG GTG TGG GCG GTG AAA GCC GGG CGC TTG AAG 528 
Lau Val Lys Val His Gin Val Trp Ala Val Lys Ala Gly Gly Leu Lys 

165 170 175 

AAG GAC TCG CTG CTC ACC GTG CGC CTG GGG ACC TGG GGC CAC CCC GCC 576 
Lys Asp Ssr Lau Lau Thr Val Arg Lau Gly Thr Trp Gly His Pro Ala 
180 185 190 

TTC CCC TCC TGC GGG AGG CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC 624 
Pha Pro Sar Cys Gly Arg Lau Lys Glu Asp Sar Arg Tyr I la Pha Pha 
195 200 205 

ATG GAG CCC GAC GCC AAC AGC ACC AGC CGC GCG CCG GCC GCC TTC CGA 672 
Mat Glu Pro Asp Ala Aan Sar Thr Sar Arg Ala Pro Ala Ala Phe Arg 
210 215 220 

GCC TCT TTC CCC CCT CTG GAG ACG GGC CGG AAC CTC AAG AAG GAG GTC 720 
Ala Sar Pha Pro Pro Leu Glu Thr Gly Arg Asn Lau Lys Lys Glu Val 
225 230 235 240 

AGC CGG GTC CTG TCC AAG CGG TGC G 745 
Sar Arg Val Lau Cys Lys Arg Cys 
245 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 160: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEONESS: 

(0) TOPOLOGY: linaar 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is 

unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Xaa Ala Lau Ala Ala Ala Gly Tyr Asp Val Glu Lys 

15 10 

(2) INFORMATION F R SEQUENCE IDENTIFICATION NUMBER: 161: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is 

unknown. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Xaa Leu Val Leu Arg 

1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 162: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in positions 1, 2, and 

3 is unknown. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Xaa Xaa Xaa Tyr Pro Gly Gin I la Thr Ser Asn 

15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 163: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N in positions 25 and 36 is 

unknown* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
ATAGGGAAGC GCGGGGGAAG GGTCNCCCTC NGCAGGGCCG GGCTTGCCTC TGGAGCCTCT 60 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 164: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS s single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 
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(D) OTHER INFORMATION : N in position 16 is 

unknown . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
TTTACACATA TATTCNCC 18 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 165: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

Glu Thr Gin Pro Asp Pro Gly Gin He Leu Lys Lys val Pro Met Vai 
15 10 15 

He Gly Ala Tyr Thr 
20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 166: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Met Arg Trp Arg Arg Ala Pro Arg Arg Ser Gly Arg Pro Gly Pro Arg 
15 10 15 

Ala Gin Arg Pro Gly Ser Ala Ala Arg Ser Ser Pro Pro Leu Pro Leu 
20 25 30 

Leu Pro Leu Leu Leu Leu Leu Gly Thr Ala Ala Leu Ala Pro Gly Ala 
35 40 45 

Ala Ala Gly Asn Glu Ala Ala Pro Ala Gly Ala Ser Val Cys Tyr ser 
50 55 60 

Ser Pro Pro Ser Val Gly Ser Val Gin Glu Leu Ala Gin Arg Ala Ala 

65 70 75 ' 80 

Val Val He Glu Gly Lys Val His Pro Gin Arg Arg Gin Gin Gly Ala 
85 90 95 

Leu Asp Arg Lys Ala Ala Ala Ala Ala Gly Glu Ala Gly Ala Trp Gly 

100 105 HO 

Gly Asp Arg Glu Pro Pro Ala Ala Gly Pro Arg Ala Leu Gly Pro Pro 
115 120 125 

Ala Glu Glu Pro Leu Leu Ala Ala Asn Gly Thr Val Pro Ser Trp Pro 
130 135 140 

Thr Ala Pro Val Pro Ser Ala Gly Glu Pro Gly Glu Glu Ala Pro Tyr 
145 150 155 160 
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Leu Val Lys Val His Gin Val Trp Ala Val Lye Ala Gly Gly Leu Lys 

165 170 175 

Lys Asp Ser Leu Leu Thr Val Arg Leu Gly Thr Trp Gly His Pro Ala 

180 185 190 

Pha Pro Ser Cya Gly Arg Lau Lya Glu Aap Sor Arg Tyr Ila Pha Pha 

195 200 205 

Nat Glu Pro Aap Ala Aan Ser Thr Sar Arg Ala Pro Ala Ala Pha Arg 

210 215 220 

Ala Sar Pha Pro Pro Lau Glu Thr Gly Arg Aan Lau Lya Lys Glu Val 

225 230 235 240 

Sar Arg Val Lau Cya Lya Arg Cya Ala Lau Pro Pro Gin Lau Lys Glu 

245 250 255 

Mat Lys Sar Gin Glu Sar Ala Ala Gly Sar Lya Lou Val Lau Arg Cya 

260 265 270 

Glu Thr Sar Sar Glu Tyr Sar Sar Lau Arg Pha Lys Trp Pha Lys Asn 
275 280 285 

Gly Asn Glu Lau Asn Arg Lys Asn Lys Pro Gin Asn lie Lys Ila Gin 
290 295 300 

Lys Lys Pro Gly Lys Sar Glu Lau Arg He Asn Lys Ala Sar Leu Ala 
305 310 315 320 

Asp Sar Gly Glu Tyr Mat Cys Lys val lie Sar Lya Leu Gly Asn Asp 

325 330 335 

Sar Ala Ser Ala Asn He Thr He Val Glu Ser Asn Ala Thr Sar Thr 
340 345 350 

Ser Thr Thr Gly Thr Ser His Leu val Lys Cya Ala Glu Lys Glu Lys 
355 360 365 

Thr Pha Cya Val Aan Gly Gly Glu Cya Pha Met Val Lys Asp Leu Ser 

370 375 380 

Aan Pro Ser Arg Tyr Leu Cya Lya Cys Pro Aan Glu Pha Thr Gly Aap 
385 390 395 400 

Arg Cya Gin Aan Tyr Val Mat Ala Sar Pha Tyr Sar Thr Sar Thr Pro 
405 410 415 

Pha Leu Ser Leu Pro Glu 
420 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 167: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

Mat Ser Glu Arg Lya Glu Gly Arg Gly Lya Gly Lya Gly Lya Lya Lya 

15 10 15 
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Glu Arg Gly Ser Gly Lye Lye Pr Glu Ser Ala Ala Gly Ser Gin Ser 

20 25 30 

Pro Arg Glu lift He Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr 
35 40 45 

Val Ser Ser Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala 
50 55 60 

Aen Thr Ser Ser Ser 

65 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 168: 
(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 19 

(B) TYPE: amino acid 

(C) STRANDSDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Arg Lys Gly Aep Val Pro Gly Pro Arg Val Lya Ser Ser Arg Ser Thr 

15 10 15 

Thr Thr Ala 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 169: 

(1) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 231 

(B) TYPE: nucleic acid 

<C) STRANDSDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

CGCGAGCGCC TCAGCGCGGC CGCTCGCTCT CCCCCTCCAC GGACAAACTT TTCCCAAACC 60 

CGATCCGAGC CCTTGGACCA AACTCGCCTG CGCCGAGAGC CGTCCGCGTA GAGCGCTCCG 120 

TCTCCGGCGA GATGTCCGAG CGCAAAGAAG GCAGAGGCAA AGGGAAGGGC AAGAAGAAGG 180 

AGCGAGGCTC CGGCAAGAAG CCGGAGTCCG CGGCCGGCAG CCAGAGCCCA G 231 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 
<B) TYPE: nucleic acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

CCTTGCCTCC CCGATTGAAA GAGATGAAAA GCCAGGAATC GGCTGCAGGT TCCAAACTAG 60 

TCCTTCGGTG TGAAACCAGT TCTGAATACT CCTCTCTCAG ATTCAAGTGG TTCAAGAATG 120 

GGAATGAATT GAATCGAAAA AACAAACCAC AAAATATCAA GATACAAAAA AAGCCAGG 178 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 171: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

GAAGTCAGAA CTTCGCATTA ACAAAGCATC ACTGGCTGAT TCTGGAGAGT ATATGTCGAA 60 

AGTGATCAGC AAATTAGGAA ATGACAGTGC CTCTGCCAAT ATCACCATCG TGGAATCAAA 120 

CG 122 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 172: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 
AGATCATCAC TGGTATGCCA GCCTCAACTG AAGGAGCATA TGTGTCTTCA GAGTCTCCCA 60 
TTAGAATATC AGTATCCACA GAAGGAGCAA ATA C TT C TTC AT 102 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 173: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENG TH: 128 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

CTACATCTAC ATCCACCACT GGGACAAGCC ATCTTGTAAA ATGTGCGGAG AACGAGAAAA 60 

CTTTCTGTGT GAATGGAGGG GAGTGCTTCA TGGTGAAAGA CCTTTCAAAC CCCTCCAGAT 120 

ACTTGTGC 128 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 174: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
AAGTGCCAAC CTCCATTCAC TGGAGCAAGA TCTACTCAGA ATGTCCCCAT GAAAGTCCAA 60 
AACCAAGAA 69 



SUBSTITUTE SHEET (RULE 26) 



W0 94/26298 PCT/US94/05083 

- 148 - 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 175s 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
AAGTGCCCAA ATCACTTTAC TGGTGATCGC TGCCAAAACT ACGTAATGGC CAGCTTCTAC 60 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 176: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 
AGTACGTCCA CTCCCTTTCT GTCTCTGCCT GAATAG 36 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 177: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 177: 



AAGGCGGAGG 


AGCTGTACCA 


GAAGAGAGTG 


CTGACCATAA 


CCGGCATCTG 


CATCGCCCTC 


60 


CTTGTGGTCG 


GCATCATGTG 


TGTGGTGGCC 


TACTGCAAAA 


CCAAGAAACA 


GCGGAAAAAG 


120 


CTGCATGACC 


GTCTTCGGCA GAGCCTTCGG 


TCTGAACGAA 


ACAATATCAT 


GAACATTGCC 


180 


AATGGGCCTC 


ACCATCCTAA 


CCCACCCCCC 


GAGAATGTCC 


AGCTGGTGAA 


TCAATACGTA 


240 


TCTAAAAACG 


TCATCTCCAG 


TGAGCATATT 


GTTGAGAGAG 


AAGCAGAGAC 


ATCCTTTTCC 


300 


ACCAGTCACT 


ATACTTCCAC 


AGCCCATCAC 


TCCACTACTG 


TCACCCAGAC 


TCCTAGCCAC 


360 


AGCTGGAGCA 


ACGGACACAC 


TGAAAGCATC 


CTTTCCGAAA 


GCCACTCTGT 


AATCGTGATG 


420 


TCATCCGTAG 


AAAACAGTAG 


GCACAGCAGC 


CCAACTGGGG 


GCCCAAGAGG 


ACGTCTTAAT 


430 


GGCACAGGAG 


GCCCTCGTGA 


ATGTAACACC 


TTCCTCAGGC ATGCCAGAGA 


AACCCCTCAT 


540 


TCCTACCGAG 


ACTCTCCTCA 


TAGTGAAAG 








569 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 178: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION : SEQ ID NOt 178: 

GTATCTGTCA GCCATGACCA CCCCGGCTCG TATGTCACCT GTAGATTTCC ACACGCCAAG 60 

CTCCCCCAAA TCGCCCCCTT CGGAAATGTC TCCACCCGTG TCCAGCATGA CGGTGTCCAT 120 

GCCTTCCATG GCGGTCAGCC CCTTCATGGA AGAAGAGAGA CCTCTACTTC TCGTGACACC 180 

ACCAAGGCTG CGGGAGAAGA AGTTTGACCA TCACCCTCAG CAGTTCACCT CCTTCCACCA 240 

CAACCCCGCG CATGACAGTA ACAGCCTCCC TGCTAGCCCC TTGAGGATAG TGGAGGATGA 300 

GGAGTATGAA ACGACCCAAG AGTACGAGCC ACCCCAAGAG CCTGTTAAGA AACTCCCCAA 360 

TAGCCGGCGG GCCAAAAGAA CCAAGCCCAA TGGCCACATT GCTAACAGAT TGGAAGTGGA 420 

CAGCAACACA AGCTCCCAGA GCAGTAACTC AGAGAGTGAA ACAGAAGATG AAAGAGTAGG 480 

TGAAGATACG CCTTTCCTGG GCATACAGAA CCCCCTGGCA GCCAGTCTTG AGGCAACACC 540 

TGCCTTCCGC CTGGCTGACA GCAGGACTAA CCCAGCACGC CGCTTCTCGA CACAGGAAGA 600 

AATCCAGGCC AGGCTGTCTA GTGTAATTGC TAACCAAGAC CCTATTGCTG TATAAAACCT 660 

AAATAAACAC ATAGATTCAC CTGTAAAACT TTATTTTATA TAATAAAGTA TTCCACCTTA 720 

AATTAAACAA 730 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 179: 
<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 23 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

TCGGGCTCCA TGAAGAAGAT GTA 23 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 180: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 23 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
TCCATGAAGA AGATGTACCT GCT 23 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 181: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
ATGTACCTGC TCTCCTCCTT GA 22 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 182: 
(i) SEQUENCE CHARACTERISTICS x 

(A) LENGTH: 22 

(B) TYFSs nucleic acid 

(C) STRANDED NESS : a ingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
TTGAAGAAGG ACTCGCTGCT GA 22 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 183: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 
AAAGCCGGGG GCTTGAAGAA 20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 184: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 
ATGARGTGTG GGCGGCGAAA 20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 185: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

Met Arg Trp Arg Arg Ala Pro Arg Arg Ser Gly Arg Pro Gly Pro Arg 
15 10 15 

Ala Gin Arg Pro Gly Ser Ala Ala Arg Ser Ser Pro Pro Leu Pro Leu 
20 25 30 

Leu Pro Leu Leu Leu Leu Leu Gly Thr Ala Ala Leu Ala Pr Gly Ala 
35 40 45 
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Ala Ala Gly Asn Glu Ala Ala Pr Ala Gly Ala Ser Val Cys Tyr Ser 

SO 55 60 

Sar Fro Pro sar Val Gly Ser Val Gin Glu Leu Ala Gin Arg Ala Ala 
65 70 75 80 

Val Val lie Glu Gly Lys Val His Pro Gin Arg Arg Gin Gin Gly Ala 
85 90 95 

Lau Aap Arg Lys Ala Ala Ala Ala Ala Gly Glu Ala Gly Ala Trp Gly 

100 105 110 

Gly Asp Arg Glu Pro Pro Ala Ala Gly Pro Arg Ala Leu Gly Pro Pro 
115 120 125 

Ala Glu Glu Pro Leu Leu Ala Ala Aan Gly Thr Val Pro Sar Trp Pro 
130 13S 140 

Thr Ala Pro Val Pro Ser Ala Gly Glu Pro Gly Glu Glu Ala Pro Tyr 
145 150 155 160 

Leu Val Lys Val His Gin Val Trp Ala Val Lya Ala Gly Gly Leu Lys 

165 170 175 

Lys Asp Ser Leu Leu Thr Val Arg Leu Gly Thr Trp Gly His Pro Ala 
180 185 190 

Phe Pro Ser Cys Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe 
195 200 2Q5 

Met Glu Pro Asp Ala Asn Ser Thr Ser Arg Ala Pro Ala Ala Phe Arg 
210 215 220 

Ala Ser Phe Pro Pro Leu Glu Thr Gly Arg Asn Leu Lys Lys Glu Val 
225 230 235 240 

Ser Arg Val Leu Cys Lys Arg Cys Ala Leu Pro Pro Gin Leu Lys Glu 
245 250 255 

Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys 
260 265 270 

Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lya Trp Phe Lys Asn 
275 280 285 

Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Aan He Lys He Gin 
290 295 300 

Lys Lys Pro Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala 
305 310 315 320 

Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu Gly Asn Asp 

325 330 335 

ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn Ala Thr ser Thr 
340 345 350 

Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys 
355 360 365 

Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met val Lys Asp Leu Ser 
370 375 380 

Asn Pro Ser Arg Tyr Leu Cya Lys Cys Pro Asn Glu Phe Thr Gly Asp 
385 390 395 400 
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Arg Cy» Gin A«n Tyr Val Mat Ala Ser Ph« Tyr Ser Thr Sar Thr Pr 

405 410 415 

Phe Lau Sar Lau Pro Glu 

420 



What is claimed is: 
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1. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide encoded by pGGF2HBS5 
deposited with the A.T.C.C. November 6, 1992 (A.T.C.C. 

5 Deposit No. 75347) with a pharmaceutical carrier. 

2. A method of making a medicament for the 
treating of muscle cells of a mammal, said method 
comprising admixing a polypeptide encoded by the E 
sequence (SEQ ID Nos. 133 and 159) and at least a portion 

10 of the peptide encoded by the DNA sequences flanking the 
E encoding sequence on clone pGGF2HBS5, deposited with 
the A.T.C.C. November 6, 1992 (A.T.C.C. Deposit No. 
75347) . 

3. A method of making a medicament for the 
15 treating of muscle cells of a mammal, said method 

comprising admixing a polypeptide defined by the formula 

YBAZCX 

wherein YBAZCX is composed of the polypeptide 
segments shown in Fig. 30 (SEQ ID Nos. 133-135, 156, 

20 159) ; wherein Y comprises polypeptide segment E, or is 
absent; wherein Z comprises polypeptide segment G or is 
absent; and wherein X comprises polypeptide segments C/D 
HKL, C/D. H, C/D HL, C/D D, C/D' HL, C/D' HKL, C/D' H, 
C/D' D, C/D C/D' HKL, C/D C/D' H, C/D C/D' HL, C/D C/D' 

25 D, C/D D' H, C/D D' HL, C/D D' HKL, C/D' D' H, C/D' D' 
HL, C/D' D' HKL, C/D C/D' D' H, C/D C/D' D' HL, or C/D 
C/D' D' HKL with a pharmaceutical carrier. 
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4. 



A method of making a medicament for the 



treating of muscle cells of a mammal, said method 
comprising admixing a polypeptide defined by the formula 



segments shown in Fig. 30 (SEQ ID Nos. 132, 134, 135, 
137-139, 156); wherein w comprises polypeptide segment F, 
or is absent; wherein Z comprises polypeptide segment G 
or is absent; and wherein X comprises polypeptide 
10 segments C/D HKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' 

HKL, C/D' H, C/D' D, C/D C/D' HKL, C/D C/D' H, C/D C/D' 
HL, C/D C/D' D, C/D D' H, C/D D' HL, C/D D' HKL, C/D' D' 
H, C/D' D' HL, C/D' D' HKL, C/D C/D' D' H, C/D C/D' D' 
HL, or C/D C/D' D' HKL with a pharmaceutical carrier. 

15 5. The method of any one of claims 1-3, wherein 

50 N-terminal amino acids are cleaved from said peptide 
comprising the £ sequence (SEQ ID Nos. 133 and 159). 

6. The method of claim 3 or 4, wherein X is C/D 

HKL. 

20 7. The method of claim 3 or 4, wherein X is C/D 



8. The method of claim 3 or 4, wherein X is C/D 

HL. 



WBAZCX 



5 



wherein WBAZCX is composed of the polypeptide 



9. 



The method of claim 3 or 4, wherein X is C/D 



25 



D. 



10. The method of claim 3 or 4, wherein X is C/D' 



HL. 
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11. The method of claim 3 or 4, wh rein X is c/D' 

HKL. 



12. The method of claim 3 or 4, wherein X is C/D' 

H. 



5 13. The method of claim 3 or 4, wherein X is C/D' 

D. 

14. The method of claim 3 or 4, wherein X is c/D 
C/D' HKL. 

15. The method of claim 3 or 4, wherein X is C/D 

10 C/D' H. 

16. The method of claim 3 or 4, wherein X is C/D 
C/D' HL. 

17. The method of claim 3 or 4, wherein X is C/D 

C/D' D. 

15 18. The method of claim 3 or 4, wherein X is C/D 

D' H. 

19. The method of claim 3 or 4, wherein X is C/D 

D' HL. 

20. The method of claim 3 or 4, wherein X is C/D 

20 D' HKL. 

21. The method of claim 3 or 4, wherein X is C/D' 

D' H. 
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22. The method of claia 3 or 4, wherein X is C/D' 

D' HL. 

23* The method of claim 3 or 4, wherein X is C/D 1 

D' HKL. 

5 24. The method of claim 3 or 4, wherein X is C/D 

C/D' D' H. 

25. The method of claim 3 or 4, wherein X is C/D 
C/D' D' HL. 

26. The method of claim 3 or 4, wherein X is C/D 
10 C/D' D' HKL. 

27. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising FBA 
polypeptide segments having the amino acid sequences 

15 shown in Fig. 30 (SEQ ID Nos. 132, 134, 135) with a 
pharmaceutical ly acceptable carrier. 

28. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising FBA' 

20 polypeptide segments having the amino acid sequences 
shown in Fig. 30 (SEQ ID Nos. 132 , 134, 136) with a 
pharmaceutical ly acceptable carrier. 



29. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
25 comprising admixing a polypeptide comprising FEBA 

polypeptide segments having the amino acid sequences 
shown in Fig. 30 (SEQ ID Nos. 132 , 135, 159) with a 
pharmaceutical^ acceptable carrier. 
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30. A meth d of making a m dicament for th 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising FEBA' 
polypeptide segments having the amino acid sequences 
5 corresponding to polypeptide segments shown in Fig, 30 
(SEQ ID Nos. 132-134, 136, 159) to muscle cells with a 
pharmaceutical^ acceptable carrier. 



31. A method of making a medicament for the 
treating of muscle cells of a mammal said method 

10 comprising admixing G6F2 polypeptide with a 
pharmaceutical^ acceptable carrier. 

32. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a compound which specifically binds 

15 the plSS*'" 2 receptor of muscle cells with a 
pharmaceutical^ acceptable carrier. 

33. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising EGFL1, 

20 having the amino acid sequence shown Fig. 37 , Seq. ID No. 
150, with a pharmaceutical^ acceptable carrier. 

34. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising EGFL2, 

25 having the amino acid sequence shown in Fig. 38, Seq. ID 
No. 151, with a pharmaceutically acceptable carrier. 
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35. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising EGFL3 , with 
the amino acid sequence shown in Fig. 39, Seq. ID No. 

5 152 , with a pharmaceutical ly acceptable carrier. 

36. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising E6FL4 , with 
the amino acid sequence shown in Fig. 40, Seq. ID No. 

10 153, with a pharmaceutically acceptable carrier. 

37. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide comprising EGFL5, with 
the amino acid sequence shown in Fig. 41, Seq. ID No. 

15 154, to muscle cells, with a pharmaceutically acceptable 
carrier. 

38. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a polypeptide, comprising EGFL6, with 

20 the amino acid sequence shown Fig. 42, Seq. ID No. 155, 
with a pharmaceutically acceptable carrier. 

39. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 35 kD polypeptide factor isolated 

25 from the rat I-EJ transformed fibroblast cell line to 
said muscle cells, with a pharmaceutically acceptable 
carrier • 
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40. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 75 kD polypeptide factor isolated 
from the SKBR-3 human breast cell line to said muscle 

5 cells, with a pharmaceutically acceptable carrier ♦ 

41. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 44 kD polypeptide factor isolated 
from the rat I-EJ transformed fibroblast cell line to 

10 said muscle cells, with a pharmaceutical ly acceptable 
carrier. 

42. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 45 kD polypeptide factor isolated 

15 from the MDA - MB 231 human breast cell line to said 

muscle cells, with a pharmaceutical ly acceptable carrier. 

43. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 7 to 14 kD polypeptide factor 

20 isolated from the ATL-2 human T-cell line to said muscle 
cells, with a pharmaceutically acceptable carrier. 

44. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 25 kD polypeptide factor isolated 

25 from activated mouse peritoneal macrophages to said 

muscle cells, with a pharmaceutically acceptable carrier. 
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45. A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a 25 JcD polypeptide factor isolated 
from bovine kidney to said muscle cells, with a 
5 pharmaceutically acceptable carrier. 

46* A method of making a medicament for the 
treating of muscle cells of a mammal said method 
comprising admixing a ARIA polypeptide to said muscle 
cells, with a pharmaceutically acceptable carrier. 

10 47. A method of making a medicament for the 

treating of muscle cells of a mammal said method 
comprising admixing a 46*47 kD polypeptide factor which 
stimulates 0-2A glial progenitor cells to said muscle 
cells, with a pharmaceutically acceptable carrier. 

15 48. A method of making a medicament for the 

treating of muscle cells of a mammal said method 
comprising admixing GGF-III to said muscle cells, with a 
pharmaceutically acceptable carrier. 
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49. A method of making a medicament for th 
treating of muscle cells of a mammal, said method 
comprising admixing with a pharmaceutical ly acceptable 
carrier , a DNA sequence encoding a polypeptide of the 
5 formula 

YBAZCX 

wherein YBAZCX is composed of the polypeptide 
segments shown in Fig. 30 (SEQ ZD Nos. 133*135, 156, 
159); wherein Y comprises polypeptide segment E, or is 

10 absent; wherein z comprises polypeptide segment G or is 
absent; and wherein X comprises polypeptide segments C/D 
HKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' HKL, C/D' H, 
C/D' D, C/D C/D' HKL, C/D C/D' H, C/D C/D' HL, C/D C/D' 
D, C/D D' H, C/D D' HL, C/D D' HKL, C/D' D' H, C/D' D' 

15 HL, C/D' D' HKL, C/D C/D' D' H, C/D C/D' D' HL, or C/D 
C/D' D' HKL, said DNA in an expressible genetic 
construction • 
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50. A method of making a medicament for the 
treating of muscle cells of a mammal, said method 
comprising admixing with a pharmaceutically acceptable 
carrier, a DKA sequence encoding a polypeptide of the 

5 formula 

WBAZCX 

wherein WBAZCX is composed of the polypeptide 
segments shown in Fig. 30 (SEQ ID Nos. 132, 134, 135, 
137-139, 156) ; wherein W comprises polypeptide segment F, 

10 or is absent; wherein Z comprises polypeptide segment G 
or is absent; and wherein X comprises polypeptide 
segments C/D HKL, C/D H, C/D HL, C/D D, C/D' HL, C/D' 
HKL, C/D' H, C/D' D, C/D C/D' HKL, C/D C/D' H, C/D C/D' 
HL, C/D C/D' D, C/D D' H, C/D D' HL, C/D D' HKL, C/D' D' 

15 H, C/D' D' HL, C/D' D' HKL, C/D C/D' D' H, C/D C/D' D' 
HL, or C/D C/D' D' HKL, said DNA in an expressible 
genetic construction. 

51. A method of making a medicament for the 
prophylaxis or treatment of pathophysiological condition 

20 of the musculature in a mammal in which said condition 
involves a muscle cell type which is sensitive or 
responsive to a polypeptide as defined in any one of 
claims 1, 3, 4, and 31, said method comprising admixing 
an effective amount of said polypeptide with a 

25 pharmaceutically acceptable carrier. 

52. A method of making a medicament for the 
treatment of a condition which involves muscle damage in 
a mamma 1, said method comprising admixing an effective 
amount of a polypeptide, as defined in any one of claims 

30 1, 3, 4, and 31 with a pharaceutically acceptable 
carrier. 
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53. The method of any one of claims 1, 3, 4, and 
31, wherein said medicament is for decreasing the atrophy 
of said muscle cells. 

54* The method of any one of claims 1, 3, 4, and 
5 31, wherein said medicament is for increasing the muscle 
fibers present in said mammal. 

55. The method of any one of claims 1, 3, 4, and 
31, wherein said medicament is for increasing muscle cell 
survival in a said mammal. 

10 56. The method of any one of claims 1, 3 r 4, and 

31, wherein said medicament is for increasing muscle 
growth in a said mammal. 

57. The method of any one of claims 1, 3, 4, and 
31, wherein said medicament is for increasing muscle 

15 regeneration in a said mammal. 

58. The method of any one of claims 1, 3, 4, and 
31, wherein said medicament is for stimulating muscle 
cell mitogenesis. 

59. The method of any one of claims l, 3, 4, and 
20 31, wherein said medicament is for increasing 

acetylcholine receptor synthesis. 

60. The method of any one of claims 1, 3, 4, and 
31, wherein said medicament is for treating a patient 
lacking a neurotrophic factor. 



WO 94/26298 



PCT/US94/05083 



- 164 - 

61. A method of claims 1, 3, 4, and 31 f wherein 
said medicament is for treating a muscle cell which is a 
myoblast. 

62. A method of claims 1, 3, 4, and 31, wherein 
5 said medicament is for treating a muscle cell which is a 

satellite cell. 

63. A method of claims 1, 3, 4, and 31 wherein 
said medicament is for treating a muscle cell in skeletal 
muscle. 

10 64. A method of claims l, 3, 4, and 31 wherein 

said medicament is for treating a muscle cell in cardiac 
muscle. 

65. A method of claims 1, 3, 4, and 31 wherein 
said medicament is for treating a muscle cell in smooth 

15 muscle. 

66. A method of claims 1, 3, 4, and 31, wherein 
said medicament is for treating a muscle cell in a 
patient with a skeletal muscle disease. 

67. A method of claim 66, wherein said skeletal 
20 muscle disease is a myopathy. 

68. A method of claim 66, wherein said skeletal 
muscle disease is a dystrophy. 

69. A method of claim 68, wherein said dystrophy 
is Duchennes muscular dystrophy. 
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70. A in thod of claim 68, wherein said dystrophy 
is Beckker's dystrophy, 

71. A method of claim 66, wherein said skeletal 
muscle disease is a result of a neural condition. 

5 72. A method of claim 66, wherein said skeletal 

muscle disease is an injury. 

73. A method of claim 66, wherein said skeletal 
muscle disease is resulting from a nerve injury. 

74. A method of claim 66, wherein said skeletal 
10 muscle disease is resulting from a neuropathy. 

75. A method of claims l, 3, 4, and 31, wherein 
said medicament is for treating a muscle cell in a 
patient with a cardiac muscle disorder. 

76. A method of claim 75, wherein said cardiac 
15 disorder is cardiomyopathy. 

77. A method of claim 75, wherein said cardiac 
disorder is ischemic damage. 

78. A method of claim 75, wherein said cardiac 
disorder is a congenital disease. 

20 79. A method of claim 75, wherein said cardiac 

disorder is cardiac trauma. 

80. A method of claims 1, 3, 4, and 31, wherein 
said medicament is for treating a muscle cell in a 
patient with a smooth muscle disorder. 
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81. A method of claim 80, wherein said disorder 
is arterial sclerosis. 

82. A method of claim 80 , wherein said disorder 
is a vascular lesion* 

5 83. A method of claim 80 r wherein said disorder 

is a congenital vascular disease. 

84. A method of claims 1, 3, 4, and 31 r wherein 
said medicament is for treating a muscle cell which has 
insufficient functional acetylcholine receptors. 

10 85. A method of claim 85 wherein said muscle cell 

lacking sufficient acetylcholine receptor is a muscle 
cell in a patient with myasthenia gravis. 

86. A method as claimed in claim 84, wherein said 
condition involves muscular damage. 

15 87. A method of making a medicament for the 

prophylaxis or treatment of a muscular tumor in a 
patient, said method comprising admixing an effective 
amount of a substance which inhibits the binding of a 
factor as defined in any one of claims 1, 3, 4, and 31 to 

20 a receptor therefor with a pharmaceutically acceptable 
carrier. 

88. A method of making a medicament for treating 
a mammal suffering from a disease of muscle cell 
proliferation, said method comprising admixing an 
25 antibody which binds to a polypeptide of any of one of 

claims 1, 3, 4, and 31 with a pharmaceutically acceptable 
carrier. 
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89. A method of identifying a nucleic acid 
sequence coding for a molecule having muscle cell 
mitogenic activity, said method comprising contacting a 
cell containing sample with a muscle cell mitogen 

5 specific antibody to determine expression of said mitogen 
in said sample and isolating said nucleic acid sequence 
from the cells exhibiting said expression. 

90. The method of claim 31, wherein said 6GF2 is 
human recombinant GGF2 . 

10 91. A method of stimulating myogenesis of a 

muscle cell said method comprising contacting said muscle 
cell with a compound which specifically binds the 
pl85* rbB2 receptor of muscle cells. 
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L VV4XHCV Vf T CKflC'OlEi' 

CTCCTOCTTCCCATCA Iti M T C T C CTCCTCT ACrCCAAAACCAACAAACAAOCCAAAAAa ma 

ii inn iiiiiiiiiiiiiiiiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 if 1 1 iif nifiijif 

Ctt<Jt«tCflgC4tC4tqtfftgtflrt««Ct4CtgC««*4CC«4g444C490ji4*4Mf 



lit 



LIOlLlQfLtf IBirillfi' 

crrcATCAcccocTrccccAGAaccrrcccicriuACA^ 

ii iiiinit 1 1 1 f 1 1 1 1 1 1 T 1 1 1 f i f 1 1 1 1 1 1 1 nun iiiiiimii iii 

ctgc»tg*cctytcttctjgc«g4gccttC9gtctg44cg«44cj4t4tg4tg44C4ttgce 
AACCgCCCCCACCACCCCAAtCC0CCCCCCaXC4ACCT0ai0CT^ ua 

II Hill Mill II 11 II lllllllllll H lllfflllfff fffl 1 1 fff l 

44tgggcctC4CC4tcct44CCC4c«cccg4g*4tgtcc4gctggtg44tC44t4cit4 
t iitx ss'bix'v t ItAI «* tfi' 

TCTAAAAATCTCATCTCtAOCCACCXTATTCTTCACACAGAeCCCCACAflCTCTTTTXCC ]«• 

1 1 1 1 1 1 1 1 Mllllll II 1 1 If III 1 1 1 Jill IIMII II (III MNlRT 
tct««A«acgtc«tctccagtg«gc4e«tt?ttg4g4gtg4«gc«g«g4C4tccttttec 

t siTffr'Aii*srrTr«r*p«i* 

ACCIieTCACrACACTTCC^CXCCTOTCATTCCACTACTCTCACTCA^CTCCCXeTCAC )fd 

lllllllllll Mill Hill Hill I Mil IIIJIIIII If If fff f || HI 
4ce4gtc4ct4Ucttee4C49CCC*teactcc4ct4ctgtC4eee49«ctect4gce4« 

s vfiair'tt x'x • i t i j ?* x ti' 

AOCXC^ACOATCCACACACTCAAAC^TCATTPCCCAMOCCkCTCTCTCATCCTCA^ <J0 

lllllllllll lllllllf llllllllll Ml f fff fff fff If ff IIHflfff 
«gctgg*gc4*cgg»c»C4ctg«tigc4tcctttccg*4*gcc4ctctgt44tcgtg*tg 

TCArccCTAgAAMaurrAOCQacaicccccACTCflccocccc^ <i« 
IIIIIIIIIIIIIIIMIIIIIIIIIIIIIII lllllllllll lllllllllll III 

tc*tcegtagit««C4gt4ggc4C4gc4gcce««ct9ggg9ccc««9«^«e7tctt4«t 

OLOOrit CII F LBIAt|*f»o* 

CCCTTCCCACCCCCTCCTCAATGTAACAGCTTCCTCACCCATGCCAeACAAACCCCTCAC S40 

iii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f f f 1 1 1 1 r 1 1 1 1 1 1 1 j r r 

99C4c«99499Ccete9t9«At9t««ea9Ctteetei99ettqcca9«f«««ccect9tt 
S Y A 0 J > X S t * 

TCCTACCCACACTCTCCTCATACTCAXAC S«t B AD ORIGINAL 

M"!'i|Mi: 1 1 1 I I M 1 I f I I I I I I 
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coexxc jkxdtt am a mt a?) FIGIT Y 30 bi S 

^ € V ^ J 

CO^cTrrCCCCXACTCATCTTACAKTTCTT^CAn^CCCAlTW 
QbfXTlLtlllZFIVA I f | | 

fiACCCCTTCCCCTTUqCAAa 

f t ff F tr C 1 

coozxa stoaxT Lt (so} a k>i 143) 



ft«t9t«te«9«c«(9«cc4ccee^9cte9t4t9te«ectgt«^tttec«Me9ce««f 

CTCCCCCAACTCACCCCcrrCOCAMTyrCCCcqcC Wre TCCACCAC^ 

1 1 1 1 1 1 f 1 1 II MltllllllllNNI.il 1 1 III 1 1 1 1 1 1 1 1 irniMlflf 

FJR*A VS ? f T t" 1 ll'ftLtTf,* 

QCCCTCCATCOCC6tCAC yxv r KC tOCAACACCACACA C C C C TOC r CcmW Ac6« lift 

111 iiiniiiiiiui 1 1 1 111 if 1 1 1 1 1 ilium 11 1 7 Tin mi 

1CCttCC4tq9C^gtCMCCCCttCJtq91«f4*7«9» 9 4CCtCt4CttCtC^tj«C4ei 

rib'iii«TDi'iAQ'Qriiff C ' 

ACCACC6CTCCC6CACAAd. . .?AKACO<^<XCCOaOUtTTOUCTccn«OCTC lift 

(III 1 1 1 1 1 1 1 1 1 1 1 1 II Mill I III Mllll II I IIMmil J4# 

«ec««9«ct9C49949«H«aftUf»«c«te4cccte«^c«Ytte«4ctcettee««e» 

IF ■ f • g 

B »A*I ttfilt*FtF a fttXf|i|* 

9^CCCC6CCCAfCA6ACCAA C MC CT0 CCCCC CA 0CCCCtt^^ ]M 

1 11 111111111111 11 mi 111 11 1 injUMNiiiiiimiiiiiiiiii 

c«4eceeqc9C«t|tc««U«c«KCtceet|«t44ececit«4M«u«t««4«4Mt 

IYt*tfOtTI»*AOt*»Vlitff|* 

GCAATATCAAACCACCCAOCACTACCAACCAacfCAACACCCGCTTAACAAACTGlceAA if 

111 11111111111111 1 1 1 1 1 1 1 1 mil 1 1 1 f J rti 111111111111 nil 

99«7tat9444C94cee««f49t4C)4^ee«4«ee44949cetgtt44f444ete9ce44 

A 

S S * R A X Iff '** c n't A I i lcr' 

CACCACCCCCCCCCCCAAAACAACCAACCCCAATCCTCACATTCCCCACACCnCCJUUT OB 

1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ' 1 1 1 f I f I ! ! 1 1 Ml MM Mil!; . 
. . C«9ceqqc999ee««*«9** ee4 Mcee*«tq9ce«e«tt9ct««c«9«tt9f««ft 

* v 
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WOM/2G98 32/55 PCT/US94/Q5083 

AC RE 30 7/9 

0 I I 't « a o f I v * s i "ft r t o f i 

ccACJUcAAaacccecrcAacacTjacTa»aeccAJuacAc«AfcjuiaM 
Mill lllltll II I I IIIIIIIMIIIIIIII llllllll IJIimim 



411 



«tOT»F bAt« ■'trAAfltAl' 

ACCACAAGATACOCCmeCTCCCaiTACACJaCCCecrCCCACCCAOTCTCCACCCetf u* 
Ml lllllllllllllllllll IIIIIIIIIIIIIIIIIIIIIIIIIM 1 1 JJIJ 

* f 

PAr'tivoitr' iff 

CCCI0CCmi;< X CT C CTCCACACCACCACrMCCCUCACCC0« ClTCI^ l Kw l 0O ^ <M 

1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 f 1 1 1 f f ff 1 f 1 1 1 1 Hill III Mil I Hill # 
•cctqccttccqcct(jflct<jic»<jc»<j«j«ct««ccc4j<«fl9cc^cttctoytc*c«ffi 

IL«*AtlSOTX*Alt'ofXA?« 

AC AATTOCAOOCOOflCfCTCCOCyCTAATCOCf AACCAACACCCTAfC CC f G f V T A AAA ««• 

MM I lllllllllll II lllllll lllllllllllllllll IIHI IMJI 
Mt««tee«^cc«9fct9tet««t9t4*tt9Ct«ftce««9tccctattfct9t«t«Mt 

CCCAAATACACCCATACATTCACCTCTAAMCinAfTrtATAfMTAAA^AfTCQ^ Tift 

it inn ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 fiii iiiftiiifiiffiffii i rTifff 

«cta««t«««c«e««a9«tte«cet9t«a«««ttutut«t«t««t««««t«tteeMt 

rTAAATTAAACU »JJ 

iiiiiiiiimi 
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33/55 

FIGURE 30 3/8 



A W w * ¥ 

?T¥T 3 S*TTTa^^ ,4. 

aafaa«f»» t TtTiiltxiiT 

« * v t C « I c '«» 
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(5«, t> «, , 4 V) FIGURE 31 '/i 

Mflim Auclootldo Monaco «nd 4#4uch prettlA ittuinco 
ACTTTCCCCCCCaACTTCTCCCXACTa^ l# 

cecccctcc^eeccAtc^cccccccccccAcccTUt^ XJi 

TCCCACCCC^CCCACCCA^CA6Caa^0CC<ACe«CC<CC«CAACCCAeaCTCe no 
CCACCC*C?CCC(^C«C<a«<XACtXCC^Gn^ j 4# 

ACTCCCACCTCreCCC^CCOCA lU 'I WWCCCU ^ ^ )0# 

QCTCCCCCCOC0CCCCCCCC0CX , TCCCCCCt>rK )ft 

AMCrrrtCCCCAACCCCAkcCACCCCTCCCACCCAMC^ 4S9 

CCCCACCCCTCCCCCCACAGCCTOCACTTCTCC<MCCACATCrCCCACCCCACAC^ 410 

■ i t I i f f 

AAACOCAACGCCA A CgXOCCAA fl AACCACCCACC^ S4d 
eCTG6C0QCCC6A0CCCAGCt.TlW<.tlCcCwwC I'lUAAAGACAtCAACACTCACCACTCT 100 

A««»itAtf'ii»isiitaii 

CTQCCAOCmCAAACTAoiwnCCCt^ACACCAOTTCrCUTACTCCTC^T^^ MO 
TAaiKLTltCttlttTfflf 

WJUCTecfrCAACMW^CTCAArtAAOCC C AMCAA 7J0 

f ivrti«iibi iiit»«fii 

ATACACAA UG OCCCCCCAAflTCAgAACricOCACTAC 710 
X OKItatfllbB ZflAflAOf 

<^<^TATAlWXl)U^CrUTa«CAAA | 40 
4 ITICK?tlIL4IOSA IAIX 

ACCAITWOOAQTCAMCCAaiacCACrCCCATCCCACC^ 900 
T tTIOIISft«l»AIT!flT 

QtU tCrfCAflACTCICCCATtAfiAAtATCACTATCAAOAAAOftMeAAA 900 

TCCACATCCACAtCTACACCTMCACAAOCCATCTrGTCAACTCTXACAC^ 1910 
• T • t • f A • T • ■ LTICA1ESI 

ACTTTCICTCTGAATOCACOCCACTCCTr^TCCTCAA^ 1010 
T reTf 4«t€FKVg»tfl tft 

TACTWTCCAACTOCCCAAATCACTTTACTCOTCATCCCT^^ 1240 
-Y'LCKCPIt't* OtCOHTVIA 

ACCTrCTACACTACCICCAaCCCTTTCTCTCTCTCC«3A>tACC«circaaCTC« 1200 

jrrr«srsrrrLSLPf« 

cTccc:crrrrrrc:::rc:cA:cT:rrrrrACArr:rA:rTA:ACC7A:A:cccrrr:i i;«o 

7 - . . T TT .V T- s 7 7 T : T 7 • * • • •••••• • - •* 4 • +\ . . * . X \ .Z . 

\ 
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FIG, AE 31 35/55 

TCACTrCC TC T CTCCu f UC 



CTA«TC«CTWA^ACrcCTACCT^JUC<x Tcc ^ 



13l# 

CAATCACAATAAACCCCTTCAAAACTCTCACTTTTAn^A 



TCTrrcKAAArwAwnuAnAcwTCATActaat^ 



^AAAAATCCTTCCAC MM 

<^A<*cTcccrmcmAT^ ^ 

TAACCACTAaCACTTCAAATCATCCTAAClTCCCT^ HJ| 
ACAAATAMaCAATAAAAAAAAAAAAAAAAAA Iff} 
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(JO? u go: |45* 

FIGURE 32 

coram] ftuclmld* «M d«uct4 protiiA i«^ #AC# 

^mctccrrc^ J|# 

CAC^CCCCA^ 

AAC4AeAAAAGTTTCT6tcfGAATCCACCCCAGT6CnCATCC^ IM 

iiitrcTia«teri?to Lit 

SCCTCAAttfAcmTKAjtfTeCCAACCrceAira ... 

'■tTLcicgrcrrtAic t t i M 

??^eATeAAtt?CCAMCCaUlCAAA^ fM 
CAA^CTAWJUtCOCC^qCTICTAaCTACCTCCA CTCCCim > 

^'TTiAtrTirtrrrbi t > i 

yAccceATcWA C Tc cc T c cc c ctrrcr w rr c ccocAt^cccT ^ , 00 

AGCTAOATGttTmACCAOTCTAACAlTCACTCCCTCX^^ ,|0 
AACACAACCCATT6TAtGACTTCCTCTCTCCCTCACTACTC66CTCTCA©CTACTCCTAC 1010 
CTCCCTAACCCTCCACTCTTrCTCAJUTTC ATCtTCAA1TACT6TCAT*CCACATCATAC 1 Of 0 



me 
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<m»»: w« FIGURE 33 i /a. 

0473 MM micltotld* toluene* «nd 4#4uct4 prottiii i4j*ju«nc« 

CAACTaCAACTKCameOUUCCCraCTGC^ 
ISttlXSKAf LADSatfH « |[ 

ACTCATCAOauaCTAGGAAATCACAei^CTCTCCOU^TCACam ... 

vijitcwosAJAixt:?f j^J* 

CCCCAaTC^CATCTACACCTC<^CJUkCCCATCmT^ , 1A 
ATStSTAarSILTICAlI | g 1,0 

AACTTrCTCTCTCAATCGACeCCACXGCn^TGCT6AAA^ 
TrCTieODCFMTIOtflF j ,f Mf 

ATACrTCTe^CTCCOUCCTCCA1TCACTeCAC<CA6ATCTACr^ , flfl 
V tCIC*»«rT4AlCftlVfa "' 

<MAACTCCJuL\CCCJaCAJUUUCCTXACCA JM 
KVQT4tKAtf LTOIIflt r ? 

CCOCAmgCATCCCOCT G CTC Ore TrOSCATCA W tCW Ofi 
0XC XAttT?4XICf??f C I? 

<UACAMCAACCCAAAAA<CTrCAtCACCCOCTTCCC^ 4tA 
Bit) lilt I»ltl«ttlt| i» w 

CACCATOAI^CCTAOCCJUkCC^CCCACCACCCCAAWCC^ 9(A 
THH ITAiatl I f» * * * I I T Q 

CCTCCTCAATCAATACGTATCrAAAAATCTC^TCTCTACCCACCATAtT^ <00 
LTIQTTSKI* X • • I ■ X V I | f 

GccGCACAecTCTrmccAccACTatcTACA^^ Me 
AttfriftifrfYAiitrr? 

C^CTCAGACTCCOGTCACACCTCCACCAATCCACACACTGAAACCATCA 720 

TQfrsiffsiair ititttt 

CCACTCra^TCCTOATCrCATCCGTA<aU^ 7|« 
■ • T X VI • #?■■•■■•• » t • f 

CCCgAOl^CCTCTCAAWCCrTCCCACCCVVfCt; rCAAtVrJUUACcfTCCTCACOCA I4f 

»i4iti«t4« r i i c i i r b t i 

TCCCACACAAACCCCICACTCCtACCCACACTCrCCT C ATACWAMCA^TA^ >00 

Aiiff»iriii ri t i i i i l t 

AGCTCA<XTAACCACAJUkCUC«COCACAT^^ 1(0 
AILAIIIAI1S KCROt CjLlA 

AACTCATCTTACAOCn^TTCCATTCCCCATTCCCCrrCATTCTCTAACACCCCTTCCCC 10>0 
TMlftASSXrXVAS f S I T > V » 

T7tAqCAACCTATCTATCACCAATCACCACCCCCCCT«TAWTCACCTCTACAmCCA :5ic 
LC It T V $ A « T T P A * HS rVDfN 

• • * • • » 

CAeccsAA5cr:cc:cAA«T:ATCfsc:TcccAAATCTcr :«:ccj::tccaccac3ac : • t : 

BAD ORIGINAL 



W0 940GM PCT/US94/05M3 

FIGURE 33 A/X 38/55 

CTXAAACCC^T»C*CCCAI»«ITOCTOT*AMCI^*rrrrATATJUTMA«T»T 174, 
TCCACCTTXXArTJUkACJUUUUUa 17(4 
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FIGURE 34 



• t 



QS£2^KCAEKBCTFCVr«GeCFWNKDL$N^ 1 
b£fi£ eClfllCYKDFCIH-CECJCtVKQ-RAW— CKCQQSYFGERCGEKSNK1HS' 



1<S« IB WOt m> 
*<S«Q IB NOt Wfi) 
5 ($«P IB Ml. I*}) 
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FIGURE 36 



F-l-4' 



f-b-*-<-«/b-o 

f-|-JL-<-C/0-l 

F-l-L-<-</>I-l 

F-l-A<«</D-t-M. 

F-t-*-«/tf-o*-i . 
r*t»i«c-</o-o*-B>t> . 

F-l-A-<-e/&'-P 
?•••!«/•••■ 

r«t-A-<«</»'«i*t 
F-B-i-<-€/&'-i-f-t 

F-B-l««/D»-D'-I 

F-B-A-<-</9'»D'-I-l» 
f-l-A-<-«/»*-0'-t-M. 
F-B-A«*-</«-W-B 
F-B-*-<-</8-«/8'-I 
F-»-*-C-e/D-C/8' -I-L 
T-*-k-C-</X/f -t-I-l 

F- B-&-C-C/X/B ' -0' -i-t 



7-B-f-A' 

F-I-»-A-<*</>I-t 

f«t-t*l-«/D-0'* 
F*t-B-l««-</B>t'*f*t 

F-t-B-l-«rc*-# 
F-t-B-*-«/9'-I 

r-t-i-x-c-</o'-i-t 

F«I»t-*««yO'«MM 

r r C-t-l-C</O'-0'*t-fc 

F-fB-k-C-C/X/B'-f 
f-t-i-*-C-C/0-</B'»I 

F- I-i-l-C-W<yB' -I-f • t 
F-t-t-*-«/©-W-*'"« 
F- 1 « l-l-^/D-VD* -0* 
F-t-t-l-^-C/X/B*-^ -I-I-fc 



?- 
F- 

F- 
F- 
F- 
F« 
F' 
F« 
F« 
f« 
F« 
F« 

r 
r 

F 

r 

F< 

r 
r 

F 



B-i-e-e-c/B-i-t 
i-*-«-c</o-i-i-l 

B-X-4-C</D-0»-B 
B-l-4«<^B-0'-l*t 
»-l-C-e</>0' -I- i-t 
•-*««-«-«/»'•• 

B-&-4-«/ft'-t 
B-*-C-<-</B'-I-t 
B- 

B*l-C«C-</B'-0'«l 
-B-A-«-C«</»'*0*-t-*i 
•B-l-4-€-^B* 
•B-X-«-C-</t-VO»-» 
•l-A-^-C-^/X/B'-I 

.B-*-4-«yB-<y»*-B # *« . 
-i-A-O-C-C/t-C/B'-O'-I-fc . 



F-t 
F-t 
F-t 
F-t 
F-I 
F-t 
F-l 
7-1 
F-t 
F-t 
F-t 
F-I 
F-t 
F-I 
F-t 
F-I 
F-t 
F-t 
F-t 
f-t 
F-t 



■t-*-d-<«</0-t 
►t»JL*<-«</D-t«I-fc 

•H-«-C-CI/»-0'-I-t-t 
>t-&-4«/D'«l 

»B-A 0 C W'+ 
••-i««^0 # -f-I-i. 
-t-fc^-C-C/»*-0»-I»M 

.|-jL-«-c-cy»-cy»'-» 

l«l«<-C-€/»-VB'-t-t-Xi 

-•-*-«-c<^>^»'-o # -t 

>»-A-4-C</X/&'-D'-B*b 
«l-A^-«/X/0'-e*-l-t-6 
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FIGURE 36 
QOT/uneou* irucnte vwnjum oommo 
!•»•*» , 

I-l-i-«/>l 

I-I-l-€-</>0'-I-X-l, 
X*t-L«C</8'«l«t 

*-&-A-«/X/D'-I-t 
«-l-l-C-C/0-VD' -0' -I-L 



t-a-i-c-e-c/o-i-i 

l-l-A-C-C-C/O-O'-I-L 

t-t-l-O-C-C/V-l 
X-t-K-«/&'-I*t 

t-s-A-«-c-<i/x/o'*o'*i-K«t 
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WfU 43/55 



JGCCATCTOTCAACTCTCttCAC^ 

^^raTCAAACACerTTCAAATCCCTC^ 

ACraTCATCCCTCCOUUWCTACOTMT^ 

tCORCQMTVNASrTfTl f f , 



CTCTCTCTCCCTCAATAC 
L J L F t • 



(saq u *>» 



FIGURE 37 
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ecru 44/55 



ACCaTCnOTCJUMTStCaOACAACCAGAAMCm 

sNivKCAticirrcvfaag c 

TTCATCOTCWUCACCTTTCAMTCCCTCJ^ 

fWVKOLIIfSRYLCfCOfCf 

ACTCCA0CCACATCTAC?CA6AATCTeCCCATCAMGTC^ 

6AGCTCTACTAA 
t I Y • 



(stq o mt is/) 

FIGURE 38 



BAD ORIQ/NAL 



WO 9406298 
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* * I t L T 



W U lOt 112) 

FIGURE 39 
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ecru 



46/55 



PCT/USW/05083 



AGCCATCTT6TCAA6T6T0CA0A0AA0CACAAAA I ITltTb IV WAAT0CA66CCAtf*s; 
SHlVKCAlttXrrCVH T «f c 

ACTWTOAtCC^^ 



TTTATC6A6AAAGCGCAGCAGCTCTACTAA 
FN IKACttT* 



(StQ U Mi |S3) 



FIGURE 40 
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ICfU 47/55 



• ^ ^ ^ 



ACTCCCTPTCTCTCTCTCCCTCJUTA8 



(StQ IS MO: |5V) 

FIGURE 41 
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48/55 

ACTCSAGCCAGATCTACTCACMTGTGCC^WmCTCC^ 

r C A « C T I I T P H I T Q f Q g c J 8 *!™ 

AATCACmACTCGTCATCCCTCCCAAAACTACCTAATCCC^ 
VCFT601CQI YTRAIFTEA t 

GAGCTCTACTAA 
I i t • 

(JBQ U Mt ISO 



FIGURE 42 



bad or/g/nal 
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FIGURE 43 
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BMltttlfe UA Udft* ««U l«*M*C« •< OVUM! 

qg>CCCC tt ja»JU>CUU, TM ^ 1M 
H I'l l 1 1 r r r ; n 7 ^fff^ WVWM '" i^ieea e ^«Mw a efl fl Q8 ae«a M a e M 3<t 

t n u u i f n t i { * 1 1 1 • 1 

oora-t 



t i S v S uw r t v 1. 1 • s 



<aocaL>oooa*oc coxv/t mm rrr* mi i tuui i > i r i yr i nnncy fl yacm 44% 
wuux ' g»t W 8 c e o tfcwc« e a cTo ecca o oacc a Bc tm 



(SEQ ID NO: 21) 



FIGURE 44 //i 
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I i__ 



« * • • » » A » I 

ovxx-t i corn-u 

0 U U t X T t I I M | i 2 

«nx-it i 09IZ-I 

nrm^ i n irTnn^iTor fnv riyririMi ut m t in trim i w iimimi h * im 
f » t X T rxryitri 



rrrrnriiiiTTriinniniTifirrTi nn i m iitth »»iu hiii' lu u lhuul ltit 

• ttbSSBTtavbestcA iTP 

t l I I I I M I I l t« l I I, ▼ tt« 

▼ I » 
1-4 



■ T • • ■TlSbarBVF 8 ■ • ■ 8 fc 
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